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ALLERGENIC PROTEINS AND PEPTIDES FROM DOG DANDER 

AND USES THEREFOR 



5 Fackfrrnui^ ftf th* Invention 

Approximately 10% of the population become hypersensitized (allergic) upon 
exposure to antigens from a variety of environmental sources. Those antigens that induce 
immediate and/or delayed types of hypersensitivity are known as allergens (King, T. P., 
(1976) a Immunol.. 22: 77 - 105. These include products of grasses, trees, weeds, animal 

10 dander, insects, food, drugs, and chemicals. Genetic predisposition of an individual is 

believed to play a role in the development of immediate allergenic responses (Young, R. P. si 
al, (1 990) Clin. Sci. . 22: 1 9a) such as atopy and anaphylaxis whose symptoms include hay 
fever, asthma, and hives. 

The antibodies involved in atopic allergy belong primarily to the IgE class of 

1 5 immunoglobulins. IgE binds to basophils, mast cells and dendritic cells via a specific, high- 
affinity receptor FceRI (Kinet, J.P., (1990) Curr. Ppin. Immunol, 2: 499 - 505). Upon 
combination of an allergen acting as a ligand with its cognate receptor IgE, FceRI bound to 
the IgE may be cross-linked on the cell surface, resulting in physiological manifestations of 
the IgE - allergen interaction. These physiological effects include the release of, among other 

20 substances, histamine, serotonin, heparin, chemotactic factors) for eosinophilic leukocytes 
and/or leukotrienes C4, D4, and E4, which cause prolonged constriction of bronchial smooth 
muscle cells (Hood, L. E. £LaL, Immunology (2nd ed.), The Benjaniin/Qimming Publishing 
Co., Inc. (1984)). Hence, the ultimate consequence of the interaction of an allergen with IgE 
are allergic symptoms triggered by release of the aforementioned mediators. Such symptoms 

25 may be systemic or local in nature, depending on the route of entry of the antigen and the 

pattern of deposition of IgE on mast cells or basophils. Local manifestations generally occur 
on epithelial surfaces at the site of entry of the allergen. Systemic effects can induce 
anaphylaxis (anaphylactic shock) which results from IgE-basophil response to circulating 
(intravascular) antigen. 

30 The pet dog (Canis familiaris) is kept in households the world over. In houses and 

public schools where dogs have been kept on a regular basis, dog dander allergens can be 
detected in dust samples (Wood, R. A. el aL, (1988) Am Rev Respir. D>s„ 122: 358 - 363, 
and Dybendal, T. sUlL, (1989) Allergy. 44: 401 - 4,11). The prevalence of allergy to dogs as 
assessed by skin prick test is approximately 15% (Haahtela, T. £L&L (198 1) AUeigy., 26: 25 1 

35 - 256, and de Groot, H. etal.. (1991) L Allergy Clin. Immunol., £2: 1056 - 1065). In one 
study, sensitivity to dog allergen(s) was detected in 40% of asthmatic children, even though 
dogs were not kept as pets in their homes (Vanto, T. and Koivikko, A., (1983) Acta Paedjatx 
Scand. . 22: 571 - 575). 
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Treatment of patients with dog allergy by administration of dog dander extracts has 
not proven to be as efficacious as treatment of cat allergic patients with cat dander extracts 
(Hedlin, G. sUU1991) ,T AllfTPV Hin Immunol., £2: 955 - 964). As with any 
desensitization scheme involving injection of increasing doses of allergen(s), there are the 
5 drawbacks of potential anaphylaxis during treatment, and the possible necessity of continuing 
therapy over aperiod of several years to build up sufficient tolerance that results in 
significant diminution of clinical symptoms. 

Dog hair and dander extracts are complex mixtures containing a number of allergenic 
proteins. (Loewenstein, H glflL (1982) frpr-cedinps 1 1th Intf mntipnnl ffinffraw-pf 
10 -aiHn»in pv nnd n hr~i Ttnmiinologv .London, pp 545-548; Uchlin, T elaL, (1984) 

A n ergv , 22: 125 - 133; Yman, L. fiUL, (1984) Int Arch AlHrv Appl, Iimrnmol.U. M: 358 - 
368; Spitzauer, S. elaL (1993) Tnf Arch Allerpv Immunol,, 100: 60 - 67). Two allergens 
present in dog hair/dander have been purified using immunoaffinity chromatography. A 
major allergen from dog, £an_fl (Nomenclature according to the criteria of the IUIS (Marsh, 
15 D. CI fiUL, (1988) ™n Allergy, li: 201 - 209; Agl3 according to original nomenclature), 
has been partially purified by two groups (Schou, C. eLaL, (1991) Hin and Fxp. AllergV, 21 : 
321 -328 and de Groot sUL, supra). Both groups, partially purified £anf I was established 
as an allergen by CRIE analysis (Ford A. W. sUL, (1 989) Hin , film . Allergy, IS: 1 83 - 190), 
and then rabbits or Balb/b mice were immunized to obtain polyclonal or monoclonal 
20 antibodies against the allergen, respectively. Iirrniunoaffinity purified £an_f I (~25 kD in 
molecular weight, with a minor component -18 kD) which elicited a high frequency of 
positive skin prick tests among dog allergic patients was able to deplete 50 - 70% of IgE 
binding to dog dander extracts in RAST (radioallergosorbent test) analysis. While de Groot 
eta l. did not attempt to determine any amino acid sequence of CaD-f I, Schou sUL found the 
25 amino terminus of their immunoaffinity purified Canil was blocked. Hence, no amino acid 
sequence of Canf I is presently in the public domain. 

The presence of a second (minor) allergen in dog extract was detected by binding of 
IgE antibodies to dog dander/hair extracts by several groups (de Groot e±aL, Sipja, Schou, C. 
etal. . supra and Spitzauer sLaL, (1993) Tnf, Arch. Allergy Immunol 60 - 67). The 
30 molecular weight of a minor allergen was reported to be 18kD(SchoueLaL,siipm), 19 kD 
(Spitzauer fiLaL, sups) and 27 kd (de Groot eUsL, supra). It is difficult however to correlate 
these results since only one group (de Groot eLaL, sypra) affinity purified an allergen 
designated Cjffllll (originally named Dog 2 allergen). CanJII was purified from dog dander 
extracts in a manner analogous to £aD_f I using monoclonal antibodies generated to a second 
35 allergen present in extracts (de Groot sUL, supra). Molecular weight of £ani II reported by 
this group as -27 kD was later verified to be - 24 kD (Aalberse, R.C. personal 
communication). Purified £anf II allergen was found to react with IgE of only 66% of dog 
allergic patients. In RAST analysis, Cjmlll allergen was able to compete with 23% of the 
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IgE directed against dog dander extract. The amino acid sequence of £sn_f II has not been 
previously determined. 

Many patients with sensitivity to dog dander allergens are treated currently by 
administration of small, gradually increasing doses of dog dander extracts. Use of these 
5 extracts has multiple drawbacks, including potential anaphylaxis during treatment and the 
necessity of continuing therapy, often for a period of several years to build up sufficient 
tolerance and significant diminution of clinical symptoms. The ability to substitute 
compositions of at least the major dog dander allergens, such as Can fl and £ml II, would 
overcome several of these drawbacks. Thus, a source of pure allergen that could be provided 
10 in quantity for use as a diagnostic or tiierapeutic reagent and therapeutic methods that would 
overcome the drawbacks associated with dog dander extracts are highly desirable. 

Summary of the Invention 

This invention provides isolated nucleic acids encoding peptides having at least one 

1 5 biological activity of Can f I or £an_f II, protein allergens of the species Canis familiar is. 
Preferred nucleic acids are cDNAs having a nucleotide sequence shown in Figure 5 (SEQ ID 
NO: l)(Caaf I) and Figure 1 8 (SEQ ID NO: 67) (£an_f II), The invention also pertains to 
peptides encoded by all or a portion of such cDNAs (SEQ ID NO: 1 and SEQ ID NO: 67) and 
having at least one biological activity of £an_f I or £an_f II. Also contemplated are isolated 

20 nucleic acids which hybridize under high stringency conditions (e.g., equivalent to 20-27°C 
below Tm and IM NaCl ) to a nucleic acid having a nucleotide sequence shown in Figure 5 
(SEQ ID NO: 1) or Figure 18 (SEQ ID NO: 67) or which encodes a peptide comprising all or 
a portion of an amino acid sequence of Figure 5 (SEQ ID NO: 2)(£an_f I) or Figure 1 8 (SEQ 
ID NO: 68) (Canf II). Nucleic acids which encode peptides having an activity of £an_f I or 

25 Can f II and having at least 50% homology with a sequence shown in Figure 5 (SEQ ID NO: 
2)(Can_f I) or Figure 18 (SEQ ID NO: 68)(£an_f II) are also featured. Peptides having a Can 
f I or Can f II activity produced by recombinant expression of a nucleic acid of the invention, 
and peptides having a Can f I or £an_f H activity prepared by chemical synthesis are also 
featured by this invention. Preferred peptides have the ability to induce a T cell response, 

30 which may include T cell stimulation (measured by, for example, T cell proliferation or 

cytokine secretion) or T cell nonresponsiveness (i.e., contact with the peptide or a complex of 
the peptide with an MHC molecule of an antigen presenting cell induces the T cell to become 
unresponsive to stimulatory signals or incapable of proliferation). Other preferred peptides, 
either apart from or in addition to the ability to induce a T cell response, have the ability to 

35 bind the dog dander specific IgE of dog dander-allergic subjects. Such peptides are useful in 
diagnosing sensitivity to dog dander in a subject Still other peptides, either apart from or in 
addition to the ability to induce a T cell response, have a significantly reduced or negligible 
ability to bind dog dander-allergic IgE. Such peptides are particularly useful as therapeutic 
agents. 
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Other preferred peptides comprise an amino acid sequence shown in Figure 5 (SEQ 
ID NO- 2) (Canll) or Figure 18 (SEQ ID NO: 68)(£aof II). In one embodiment, peptides 
having a£anf IorCsnfH activity and comprising a portion of the amino acid sequence of 
Figure 5 (SEQ ID NO: 2) or Figure 18 (SEQ ID NO: 68) are at least about 8-30 amino adds 
5 in length, preferably about 10-20 amino acids in length, and most preferably about 10-16 

amino acids in length. 

Another aspect of the invention features antibodies specifically reactive with a 
* peptidehavingaCfflLflorCaafllactivity. A peptide having an activity of £an_f I or CajLf 
II can be used in compositions suitable for phanrtaceutical adjninistration. For example, such ' 

10 composition^^ 

allergic reactions to dog dander in a subject Nucleic acids of the invention and peptides 

having an activity of Canfl or Caaf II can be also used for diagnosing sensitivity m a 

subject to a dog dander. 

15 ftpefp e ^riptiw ofthf Drawings - _ 

Figure 1 shows degenerate primer pairs based on residues 9-15 and 30-37 of the 

rr^tureCanllr^teinus^^^^ Twomternal 
degenerate oligonucleotide probes based on £anl I protein residues 1 7-24 (Dog Probe 1 ) and 
88-94 (Dog Probe 2) are shown. , 

Figure 2 shows oligonucleotide primers used in a RACE PCR protocol to obtain the 
3' portion of the Canil cDNA. A degenerate oligonucleotide probe (Dog Probe 4) is also 

ShOWI1 ' Figure 3 shows primers used in an anchored PC* technique to determine the 5" end of 
the £an_f I cDN A. A degenerate oligonucleotide probe, Dog Probe 0, based on residues 9-17 

25 nf the Can f protein is shown. . 

Figure 4 is a schematicrepresentation of a PCR sequencing strategy used to obtain 
the sequence of the mature Cmil protein from both strands of amplified cDN A. 

Figure 5 is the cDNA sequence and deduced amino acid sequence of C_an_f I. 
Figure 6 is a schematic representation of the strategy used to express CjuiII 

30 recombinant protein in bacteria. 

Figure 7 is a schematic representation of the strategy used to express Can! I 
recombinant protein in a mammalian cell using the P J7L expression vector. 

Figure 8 is a schematic representation of the strategy used to insert a His6 reporter 
group at the carboxy terminus of the recombinant ^ I protein to aid purification of the 

35 protein. ' rT 

Figure 9 shows the alignment of three partial 3' Qmi I cDNA sequences (Canll, 
2Can fI and 3£an_fl)- An (*) indicates that the position in the alignment is perfectly 
conserved and a (.) indicates that the position is well conserved. A (-) was inserted where 
necessary for purposes of alignment. 
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Figure 10 is a graphic representation depicting the response of T cell lines from 
patients primed in vitro with recombinant Canf l (i£aiif I) and analyzed for response to 
rCanf l and various peptides derived from Canfl by positivity index (% of patients who 
positively responded multiplied by the mean stimulation index). 
5 Figure 1 1 is a graphic representation of a direct binding assay of IgE from a single 

dog allergic patient to bacterially expressed recombinant Qsilf I • 

Figure 12 shows Western blot analysis of four protein preparations (Lane 1 : dog hair 
extract; Lane 2: dog saliva; Lane 3: bacterially expressed recombinant Canf I ; and Lane 4: 
recombinant Canf I expressed in a mammalian cell culture system) probed with plasma from 
10 a dog allergic patient (#901) or with plasma from a negative control patient (#250). 

Figure 1 3 shows the design of primers based on a partial amino acid sequence of 
mature Canf II and the sequence strategy for QmS II. ( ) denotes residues which were not 
determined. 

Figure 14 is a schematic representation of the strategy used to clone Canf H cDNA. 
1 5 Figure 1 5 is a schematic representation of the strategy used to clone the 5' (A) and 3* 

(B) portions of CaiLf II cDNA flanking the sequence encoding a portion of the amino acid 
sequence (shaded) of native Caaf II- 

Figure 16 is the nucleotide sequence of primers used in cloning Canf II cDNA. 
Figure 17 shows the sequence strategy used to determine the nucleotide sequence of 
20 the Can f II cDNA clones la, lc and lj. The figure depicts inserts of the cDNA clones la 
(793 bp), lc (791 bp) and lj (774 bp). The hatched bars represent coding sequence. The 
triangles indicate the position of an initiator methionine codon (ATG); the codon specifying 
the N-terminal amino acid residue of the mature protein (START); the position of a 
termination codon (STOP); and the position of a polyadenylation signal (As). The arrows 
25 indicate the extent and direction of the sequencing reactions. 

Figure 18 is the cDNA sequence and deduced amino acid sequence (from Clone lc) 

of Canf II • - 

Figure 19 is a comparison of the deduced amino acid sequence of CaaJ II based on 
the cDNA sequence of clone lc and a portion of the native Qml II determined by protein 
30 sequencing of the N-teiminus. The amino acid residues of a signal peptide are numbered -1 9 
to-1. 

Figure 20 shows northern analysis of mRN A of different dog tissues. Total cellular 
RNA (25 mg) from dog tongue epithelial tissue, parotid salivary gland, skin, mandibular and 
submaxiliary glands, liver and spleen was subjected to Northern analysis using £aaf II 
35 cDNA as a probe. The position of RNA markers are indicated in kilobases (kb). 

Figure 21 is a comparison of the amino acid sequence of CaiLf II with homologus 
proteins MUP 6 Mouse and Rat A2U. The alignment was made with GeneWorks program. 
The signal sequences are underlined. Amino acid residues which arc identical in all three 
proteins are boxed. 
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Figure 22A-C are graphic representations of a direct binding assays of human IgE 
binding to native Canf II and recombinant £an_f II . 

Figure 23 is the nucleotide consensus sequence among cDNA clones la, lc and lj 

encoding partial or full length Caul II . 
5 Figure 24 is a graphic representation depicting the response of T cell lines from 12 

patients primed in vitro with recombinant Canil <£aaf I) and analyzed for response to 
rCani I and various peptides derived from Canf I by stimulation index. A stimulation index 
* equal to or greater than two times the background is considered "positive." 

Figure 25 is a graphic representation depicting the response of T cell lines from 12 
10 . patients primed in vitro with recombinant Can! I (r£sD_fI) and analyzed for response to 

' ,r£aiLf I and various peptides derived from Qml I by mean stimulation index for the group of 
.^patients with positive responses to the peptides. 
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plaited Desr rip^"" " f th « Invention 

This invention pertains to isolated nucleic acids encoding peptides having at least one 
biological activity of Canil or Caaf H, allergens of the species Canis familiar*. 
Preferably, the nucleic acid is a cDNA comprising a nucleotide sequence shown in Figure 5 
(SEQ ID NO: 1) (£an_fl) or Figure 18 (SEQ ID NO:67) (Qml II). 

The cDNA shown in Figure 5 (SEQ ID NO:l) encodes a Canf 1 peptide which 
20 includes a 26 amino acid leader sequence encoded by base 1 through base 78. This leader 
sequence is not found in the mature Canll protein, which is encoded by bases 79 through 
525. The deduced amino acid sequence of £anf I based on this cDNA is also shown in 
Figure 5 (SEQ ID NO: 2). The cDNA encodes a mature peptide having a predicted 
molecular weight of 19.2 kDa, with a pi of 5.53 and a single potential N-linked glycosylation 
site. A culture of E. coli transfected with an expression vector containing the cDNA 
encoding Can f I was deposited under the Budapest Treaty with the American Type Culture 
Collection on December 22, 1992 and assigned accession number 69167. 

The cDNA shown in Figure 18 (SEQ ID NO: 67) encodes a Canf II peptide which 
includes a 19 amino acid leader sequence encoded by base 195 through base 251. This leader 
30 sequence is not found in the mature £anf n protein, which is encoded by bases 252 through 
734. The deduced amino acid sequence of Canf II based on this cDNA is shown in Figure 
1 8 (SEQ ID NO: 68). The cDNA encodes a Caai II peptide having a predicted molecular 
weight of 1 8.229 kDa, with a pi of 4.54 for a rnature recombinant Canf H protein and a pi of 
4 44 for a full length (including singal sequence), recombinant Caof U protein, and a single 
35 potential N-linked glycosylation site. N-linked glycosylation may increase the molecular 
weight of the peptide and may alter the pi of the mature protein. A culture of E. coli 
transfected with an expression vector containing the cDNA encoding Qml II was deposited 
under the Budapest Treaty with the American Type Culture Collection on December 29, 
1993 and assigned accession number XXXX. 
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Accordingly, one aspect of this invention pertains to isolated nucleic acids comprising 
nucleotide sequences encoding Can f I or £an_f II, fragments thereof encoding peptides 
having at least one biological activity of £an_f I or £§nf II, and/or equivalents of such 
nucleic acids. The term nucleic acid as used herein is intended to include such fragments and 
5 equivalents. The term equivalent is intended to include nucleotide sequences encoding 

functionally equivalent Caof I or £an_f U proteins or functionally equivalent peptides having 
- an activity of Can f I or Canf H. As defined herein, a peptide having an activity of Cauf I or 
£an_f II has at least one biological activity of the £an_f I or QsaS II allergen. Equivalent 
nucleotide sequences will include sequences that differ by one or more nucleotide 
1 6 substitutions, additions or deletions, such as allelic variants, and will also include sequences 
that differ from the nucleotide sequence encoding Canf I or Can f II shown in Figure 5 (SEQ 
ID NO: 1 ) or Figure 1 8 (SEQ ID NO: 67) due to the degeneracy of the genetic code. 
Equivalents will also include nucleotide sequences that hybridize under stringent conditions 
(i.e., equivalent to about 20-27°C below melting temperature (T m ) and about 1M salt) to the 
15 nucleotide sequence of Can f I shown in Figure 5 (SEQ ID NO: 1) or £aaf II shown in 
Figure 18 (SEQ ID NO: 67). 

Peptides referred to herein as having an activity of £an_f I or Qaol II or having a Can 
f I or Canfll activity are defined herein as peptides that have an amino acid sequence 
corresponding to all or a portion of the amino acid sequence of £an_f I or£an_f n shown in 
20 Fiugre 5 (SEQ ID NO: 2) or Figure 1 8 (SEQ ID NO: 68) which peptide has at least one 

biological activity of Can f I or £an_f II. For example, a peptide having an activity of £an_f I 
or Can f II may have the ability to induce a response in £an_f I or £an_f II restricted T cells 
such as stimulation (e.g., T cell proliferation or cytokine secretion) or to induce T cell non- 
responsiveness. Alternatively, or additionally, a peptide having an activity of Caul I or Can 
25 f II may have the ability to bind (to be recognized by) immunoglobulin E (IgE) antibodies of 
dog dander-allergic subjects. Peptides which bind IgE are useful in methods of detecting 
allergic sensitivity to Canf l or Can f II in a subject. Peptides that do not bind IgE, or bind 
IgE to a lesser extent than a purified, native £an_f I or Cml II protein binds IgE are 
particularly useful as therapeutic agents. 
30 In one embodiment, the nucleic acid is a cDNA encoding a peptide having an activity 

of £arj_f I or Canf II. Preferably, the nucleic acid is a cDNA molecule comprising at least a 
portion of the nucleotide sequence encoding Can f l or £ajj_f II, shown in Figure 5 (SEQ ID 
NO:l) and Figure 18 (SEQ ID NO: 67). A preferred portion of the cDNA molecules of 
Figure 5 and Figure 18 includes the coding region of the molecule. 
35 In another embodiment, the nucleic acid of the invention encodes a peptide having an 

activity of £an_f I or Can f II and comprising an amino acid sequence shown in Figure 5 
(SEQ ID NO:2) (Can f I) or Figure 1 8 (SEQ ID NO: 68) (Canf II). Preferred nucleic acids 
encode a peptide having a£arj_f I or Can f II activity and having at least about 50% 
homology, more preferably at least about 60% homology and most preferably at least about 
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70% ho.no.ogy w«h .he sequence shown in Figure 5 (SEQ ID NO: 1) 

,SEO ID NO 67) (CmXID. Nucleic acids which encode pept.de* havmg a CaLfl or £an_f 

usably a. leas, about 98-99% homology with a sequence se, forth ,n F.gure 5 (SEQ D 
, hJJSxD orFigure .8 (SEQ IDNO: 68) M) - also witott-e scope of to 
v^o?Homo.ogy «fers to sequence simflanty between two peptides having an acyty 
^.oSnorhe.weon^nuc.eicacidn.o.ecu.es. Homology can be «1 

S- b *e compared sequence is occupied by the same base or annuo actd, then 

20 IS x SSC a. 50-C to a high stringency of about 0.2 x SSC a. 50-C In addmon, the 
2 SI in me wash step can be increased from .ow stringency , c— a, room 

lem^ture, about 22'C,^ ^ 

« from me sequences of Figure 5 and Figure 1 8 due to degeneracy > the g net,c 

30 Codons that specify the same aminoacid, or synonyms (for example, CAU and CAC are 
S^^lJL) may result in "silent" mutations which do not affect me ammo actd 

« ^In me dog dander population. One skilled in the art will apprectaK that the* 
35 ^Tule o more nucleotides (up to about 3^% of the nucleotides) 

one or more isoforms or related, cross-reacting fam.ly members ofCaof IorCaD-fll- 
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isoforms or family members are defined as proteins related in function and amino acid 
sequence to CajLf I or CsjLf II, but encoded by genes at different loci. 

Fragments of the nucleic acid encoding Can! I or Canf II are also within the scope of 
the invention. As used herein, a fragment of the nucleic acid encoding Caul I or Canf U 
refers to a nucleotide sequence having fewer nucleotides than the nucleotide sequence 
encoding the entire amino acid sequence of Canf I or Canf H protein and which encodes a 
peptide having an activity of Canf I or Canf II (i.e., a peptide haying at least one biological 
activity of the Canf I or £an_f II allergen) as defined herein. 

Preferred nucleic acid fragments encode peptides of at least about 10 ammo acid 
residues in length, preferably about 1 0-20 amino acid residues in length, and more preferably 
about 12-16 amino acid residues in length. Nucleic acid fragments which encode peptides 
having a£aiLf I activity of at least about 30 amino acid residues in length, at least about 40 
amino acid residues in length, at least about 60 amino acid residues in length, at least about 
80 amino acid residues in length, at least about 1 00 amino acid residues in length, and at least 
about 140 residues in length or more are also within the scope of this invention. Nucleic acid 
fragments which encode peptides having a Canf H activity of at least about 30 ammo acid 
residues in length, at least about 40 amino acid residues in length, at least about 60 ammo 
acid residues in length, at least about 80 amino acid residues in length, at least about 100 
amino acid residues in length, at least about 140 residues in length, and at least about 160 
20 amino acid residues in length or more are also within the scope of this invention. 

Nucleic acid fragments within the scope of the invention include those capable of 
hybridizing under high or low stringency conditions with nucleic acids from other animal 
species for use in screening protocols to detect Canf I or Canf II or allergens that are cross- 
reactive with Canf I or Canf H. Generally, the nucleic acid encoding a peptide having an 
activity of Canf 1 or Canf II will be selected from the bases encoding the mature protein, 
however in some instances it maybe desirable to select all or part of a peptide from the 
leader sequence portion of the nucleic acids of the invention. Nucleic acids withm the scope 
of the invention may also contain linker sequences, modified restriction endonuclease sites 
and other sequences useful for molecular cloning, expression or purification of recombinant 
30 peptides having an activity of Canf I or Canf II- 

A nucleic acid encoding a peptide having an activity of Canf I or Canf II may be 
obtained from mRNA present in salivary glands or other organs of the pet dog Canis 
familiaris. It should also be possible to obtain nucleic acids encoding Canll or Canf H 
from Canis familiaris genomic DNA. For example, the gene encoding Canf I or Canf II can 
35 be cloned from either a cDNA or a genomic library in accordance with protocols herein 
described. A cDNA encoding Canf 1 or Canf II can be obtained by isolating total mRNA 
from Canis familiaris. Double stranded cDNAs can then be prepared from the total mRNA. 
Subsequently, the cDNAs can be inserted into a suitable plasmid or bacteriophage vector 
using any one of a number of known techniques. Genes encoding Canf I or Canf II can also 
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be cloned using established polymerase chain reaction techniques in accordance with the 
nucleotide sequence information provided by the invention. The nucleic acids of the 
invention can be DNA or RNA. A preferred nucleic acid is a cDNA encoding Qml I or Can 
f II having the sequence depicted in Figure 5 (SEQ ID NO:l) (£an_f I ) or Figure 18 (SEQ ID 

5 NO:67)(£anfII). 

This invention also provides expression vectors containing a nucleic acid encoding a 
peptide having an activity of £an_f I or £aD_f H, operably linked to at least one regulatory 
sequence. Operably linked is intended to mean that the nucleotide sequence is linked to a 
regulatory sequence in a manner which allows expression of the nucleotide sequence. 
1 0 /Regulatory sequences are art-recognized and are selected to direct expression of the peptide 
vhaving an activity of Caaf lor £ml II. Accordingly, the term regulatory sequence includes 
promoters, enhancers and other expression control elements, Such regulatory , sequences are 
described in Goeddel; Hf "» Fvpression Technology; Methods in FnzvmologY 185 , 
Academic Press, San Diego, CA (1990). It should be understood that the design of the 
1 5 expression vector may depend on such factors as the choice of the host cell to be transformed 
and/or the type of protein desired to be expressed. In one embodiment, the expression vector 
includes a DNA encoding a peptide having an activity of CmS I or £attf EL Such expression 
vectors can be used to transfect cells to thereby produce proteins or peptides, including fusion 
proteins or peptides encoded by nucleic acids as described herein. 
20 This invention further pertains to a host cell transfected to express a peptide having an 

activity of Canf I or Canf II. The host cell may be any procaryotic or eucaryotic cell. For 
example, a peptide having an activity of Canf I or Canf II may be expressed in bacterial 
cells such as E. coli, insect cells (baculovirus), yeast, or mammalian cells such as Chinese 
hamster ovary cells (CHO). Other suitable host cells can be found in Goeddel, (1990) supra 
25 ,or known to those skilled in the art. 

Expression in eucaryotic cells such as mammalian, yeast, or insect cells can lead to 
partial or complete glycosylation and/or formation of relevant inter- or intra-chain disulfide 
bonds of recombinant protein. Examples of vectors for expression in yeast S. cerivisae 
include pYepSecl (Baldari. gLaJL, (1987) EffibjiL, £: 229 - 234), pMFa (Kurjan and 
30 Herskowitz, (1982) Cell, 30: 933 - 943), pJRY88 (Schultz sLaL (1987) £kB£, 54: 1 13 - 123), 
and pYES2 (Invitrogen Corporation, San Diego, CA). Baculovirus vectors available for 
expression of proteins in cultured insect cells (SF 9 cells) include the pAc series (Smith SLaL, 
(1983) Mol. CelLEifiL, 3_: 2156 - 2165) and the pVLieries (Lucklow, V.A., and Summers, 
M.D., (1989) Virology . 170: 31 - 39). Generally COS cells (Gluzman, Y., (1981) £§11, 21: 
35 175 - 1 82) are used in conjunction with such vectors as pCDM 8 (Aruffo, A. and Seed, B., 
(1987) Pmr. Natl. AcaH Sci. USA . M: 8573 - 8577) for transient amplification/expression in 
mammalian cells, while CHO (dhfr- Chinese Hamster Ovary) cells are used with vectors such 
as pMT2PC (Kaufman £UL, (1 987) EMBUL, 6: 1 87 - 195) for stable 
amplification/expression in mammalian cells. Vector DNA can be introduced into 
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mammalian cells via conventional techniques such as calcium phosphate or calcium chloride 
co-precipitation, DEAE-dextran-mediated transfection, or electroporation. Suitable methods 
for transforming host cells can be found in Sambrook sLil, ( Molecular Cloning : A 
T ^Tsitorv Manual . 2nd Edition, Cold Spring Harbor Laboratory press (1989)), and other 

5 laboratory textbooks. 

Expression in procaryotes is most often carried out in E. coli with either fusion or 
• non-fusion inducible expression vectors. Fusion vectors usually add a number of NH2 
terminal amino acids to the expressed target gene. These NH2 terminal amino acids often are 
referred to as a reporter group. Such reporter groups usually serve two purposes: 1) to 
10 increase the solubility of the target recombinant protein; and 2) to aid in the purification of 
the target recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
' : : expression vectors, a proteolytic cleavage site is introduced at the junction of the reporter 
group and the target recombinant protein to enable separation of the target recombinant 
protein from the reporter group subsequent to purification of the fusion protein. Such 
15 enzymes, and their cognate recognition sequences, include Factor Xa, thrombin and 

enterokinase. Typical fusion expression vectors include pGEX (Amrad Corp., Melbourne, 
Australia), pMAL (New England Biolabs, Beverly, MA) and pRIT5 (Pharmacia, Piscataway, 
NJ) which fuse glutathione S-transferase, maltose E binding protein, or protein A, 
respectively, to the target recombinant protein. 
20 Inducible non-fusion expression vectors include pTrc (Amann fiLaL, (1 988) Gene, 62: 

301 - 315) and pET lid (Studier et al. . fipne Expression Technology: Methods in 
Rnrvmologv . 185. Academic Press, San Diego, California (1990) 60-89). While target gene 
expression relies on host RNA polymerase transcription from the hybrid trp-lac fusion 
promoter in pTrc, expression of target genes inserted into pET 1 Id relies on transcription 
25 from the T7 gnl 0-lac 0 fusion promoter mediated by coexpressed viral RNA polymerase (T7 
gnl). This viral polymerase is supplied by host strains BL21(DE3) or HMS174(DE3) from a 
resident X prophage harboring a T7 gnl under the transcriptional control of the lacUV 5 
promoter. 

One strategy to maximize recombinant £an_f I or £an_f II expression in £. coli is to 
30 express the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, S., Gene Expression Technology: Methods in 
EnT^mology . 185. Academic Press, San Diego, California (1990) 1 19-128). Another strategy 
would be to alter the nucleic acid encoding the Can t I or £an_f II protein to be inserted into 
an expression vector so that the individual codons for each amino acid would be those 
35 preferentially utilized in highly expressed E. coli proteins (Wada eUL, (1 992) Nup. Acids 

R es 20: 21 1 1 - 21 1 8). Such alteration of nucleic acids of the invention can be carried out by 
standard DNA synthesis techniques. 

The nucleic acids of the invention can also be chemically synthesized using standard 
techniques. Various methods of chemically synthesizing polydeoxynucleotides are known. 
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including solid-phase synthesis which, like peptide synthesis, has been folly 
LJi-ly available DNA synthesizers (S* e.g., Uakura ^L, U.S. ^^,598,049, 
Caruthers sUL, U.S. PatentNo. 4,458,066; and hakura, U.S. Patent Nos. 4,401,796 and 
4 373 07 1 , incorporated by reference herein). 
, ' ' The present invention toner pertains to methods of producing peptides that have an 

activity of fcafl -flafB- 

directing expression of a nucleotide sequence encoding a peptide havtng an activity of £jafl 
• or Cmf n can be cultured under appropriate conditions to allow expression of the peptide to 
occur The peptide may be secreted and isolated from a mixture of cells and medium 
,0 .WtaWngmepepddehavmganaetivityofC^lorC^II. Ata-lW*,*^. 

-ceHeulmre includes host cells, media and „merbypred«c R ..Sui tt ble media for cellculnire 
are well known in the art. The peptide having an activity of CmUl or Caof fl can be 
isolated from cell culture medium, host cells, or both using techniques known m the art for 
,5. purifyingproteirctacmmiigion^^ 
uljut^on.elecwph^^ 

peptidehavinganactivityofCaiLflorCaiLfll. . 

Another aspect of the invention pertains to isolated peptides havmg an actwityofCan 
fIor£an_fII. A peptide having an activity of Calf I or Caai II has at leas, one biological 
20 activity oftheCaofl or Cml n allergen. For example, a peptide havng an acuity of Cani 
I ot CfflLf II may have the abUity to induce a response in Cauf I or Can! II specific T cells 
such as stimulation <T ceU proliferation or cytokine secretion) or to induce T cell non- 
responsiveness. Inoneembc^UapeptidetovmganactivivofCmflorCanin 
stimulates T cells as evidenced by, for example. T cell proliferation or cytokine secretion. In 
25 ,. - another embodiment, peptides having a Caof 1 orM activity induce T cell non_ 

, responsiveness in which T cells are unresponsive to a subsequent challenge with a Canil or 
Caafll peptide following exposure to the peptide. In yet another embodiment, a peptide 
having a Canf 1 or CaB_f II activity has reduced IgE binding activity compared to purified, 
native CaaflorCaoill protein. A peptide having an activity of Calf I or CanJII may 

30 differ m ammo acid sequence tram me Calf l o^ 

( S EQIDNO:2)(CanJI)orFigurel8(SEQroNO : 68){f^I0bmsuchdifferencesre^ 

in a modified protein which functions in the same or similar manner as a native Cmf l or 
rwf n protein or which has the same or similar characteristics of a native Cmflor Csofll 
protein Various modifications of theCani I or Cauf II protein to produce these and other 
35 functionallyequivalentpeptidesaredescrib^inde^ilherein. The term peptide, as used 
herein, refers to peptides, proteins, and polypeptides. „ 
A peptide can be produced by modification of the amino acid sequence of the Cani 1 
or Canill protein shown in Figure 5 (SEQIDNO: 2) (Calf I) or Figure 18 (SEQ ID NO: 
68) (CanJID, such as a substitution, addition, or deletion of an amino acid resume which ,s 
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not directly involved in the function of the protein. Peptides of the invention can be at least 
about 10 amino acid residues in length, preferably about 10-20 amino acid residues in length, 
and more preferably about 10-16 amino acid residues in length. Peptides having an activity 
of Can f I or Can f II and which are at least about 30 amino acid residues in length, at least 
5 about 40 amino acid residues in length, at least about 60 amino acid residues in length, at 
least about 80 amino acid residues in length, and at least about 100 amino acid residues in 
length are also included within the scope of this invention. 

Another embodiment of the invention provides a substantially pure preparation of a 
peptide having an activity of Can f I or£anIH. Such a preparation is substantially free of 

1 0 proteins and peptides with which the peptide naturally occurs (i.e., other canine peptides), 
either in a cell or when secreted by a cell. 

The term isolated as used herein refers to a nucleic acid or peptide that is substantially 
free of cellular material or culture medium when produced by recombinant DNA techniques, 
or chemical precursors or other chemicals when chemically synthesized. Such proteins or 

15 peptides are also characterized as being free of all other dog dander proteins. Accordingly, 
an isolated peptide having an activity of £an_f I or £ani II is produced recombinantly or 
synthetically and is substantially free of cellular material and culture medium or substantially 
free of chemical precursors or other chemicals and is substantially free of all other dog 
proteins. An isolated nucleic acid is also free of sequences which naturally flank the nucleic 

20 acid (i.e., sequences located at the 5' and 3* ends of the nucleic acid) in the organism from 
which the nucleic acid is derived. 

Peptides having an activity of Can-f I or Canf II can be obtained, for example, by 
screening peptides recombinantly produced from the corresponding fragment of the nucleic 
acid of Canfl or CanJTH encoding such peptides. In addition, fragments can be chemically 

25 synthesized using techniques known in the art such as conventional Merrifield solid phase f- 
Moc or t-Boc chemistry. For example, the Qsul I or £anf H protein may be arbitrarily 
divided into fragments of desired length with no overlap of the fragments, or preferably 
divided into overlapping fragments of a desired length. The fragments can be produced 
(recombinantly or by chemical synthesis) and tested to identify those peptides having a Qml 

30 I or£an_f II activity (i.e., the ability to induce a T cell response such as stimulation 
(proliferation, cytokine secretion), nonresponsiveness, and/or has reduced IgE binding 
activity). 

In one embodiment, peptides having an activity of Canf I or £aaiTI can be identified 
by the ability of the peptide to stimulate T cells or to induce T cell non-responsiveness. 
35 Peptides which stimulate T cells, as determined by, for example, T cell proliferation or 
cytokine secretion are defined herein as comprising at least one T cell epitope. T cell 
epitopes are believed to be involved in initiation and perpetuation of the immune response to 
the protein allergen which is responsible for the clinical symptoms of allergy. These T cell 
epitopes are thought to trigger early events at the level of the T helper cell by binding to an 
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appropriate HLA molecule on the surface of an antigen presenting cell, thereby stimulating 
the T cell subpopulation with the relevant T cell receptor for the epitope. These events lead 
to T cell proliferation, lymphokine secretion, local inflammatory reactions, leertntment of 
additional immune cells to the site of antigen^ cell interaction, and activation of the B cell 

5 cascade leading to the production of antibodies. One isotype of these antibodies, IgE, is 
fundamentally important to the development of allergic symptoms and its production is ^ 
influenced early in the cascade of events at the level of the T helper cell, by the nature of the 
* lymphokines secreted. AT cell epitope is the basic element, or smallest unit of recognition 
by a T cell receptor, where the epitope comprises amino acids essential to receptor . 

10 ignition. Aniino acid sequences which 

modifythealle^ p 

Screening peptides for those which retain a Oaf I or £aif II activity as described 
hereinc^beaccomplbhedmingoneormoreofseverddifferentassays. For example, m 
vitro CaiLfl or CanfHT cell stimulatory activity is assayed by contacting a peptide known 

15 orsu^ctedofhavmgaQ^^ 

presentsappropriateMHCmolecule^inaTcellculture. Presentation of a peptide having a 

Canf I or £anf II activity in association with appropriate MHC molecules to T cells m 
conjunction with the necessary costimulationhas the effect of trar^mitung a signal totheT 
cell that induces the production of increased levels of cytokines, particularly of mterleukin-2 
20 andinterleukin-4. The culture supernatant can be obtained and assayed for interieukm-2 or 
other known cytokine, For example, any one of several conventional W^****"* 
2 can be employed, such as the assay described in Pror Nfitl Acad . Sci USA , 8fr 1333 
(1 989) the pertinent portions of which are incorporated herein by reference. A kit for an 
assay for the production of interferon is also available from Genzyme Corporation 

25 - (Cambridge, MA). _ . ... 

Alternatively, a common assay for T cell proliferation entails measuring tritiated 
thymidine incorporation. The proliferation ofT cells can be measured in vitro by 
detennining the amount of 3 H -labeled thymidine incorporated into the replicating DN A of 
cultured cells. Therefore, the rate ofDNA synthesis and, in turn, the rate of cell division can 

30 be quantified. ^ ^ ^ , w T cell stimulating activity 

(i e the peptide comprises at least one T cell epitope) can be identified using an algorithm 
which predicts the presence of T cell epitopes in a protein sequence, such as the algonthm 
described by Hill et al., Journal of ImmvnolQRY* 14Z:189-197 (1991). The algonthm of Hill 
35 et al predicts the location of T cell epitopes in a protein by the presence of certain patterns 
within the sequence which are likely to bind MHC and therefore may contain T cell epitopes. 
Based on the Hill et al. algorithm, two 13 amino acid peptides (discussed in Example 10 
have been identified and produced synthetically. Such peptides were tested for T cell ac^ty 
as described above (e.g., by measuring cellular uptake of tritiated thymidine). Specifically, 
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in Example 10, human T cell stimulating activity was tested by culturing T cells obtained 
from an individual sensitive to CanJI (i.e., an individual who has an IgE mediated immune 
response to QsrI I) with a peptide derived from £an_f I and proliferation of T cells in 
response to the peptide was determined, e.g., by measuring cellular uptake of tritiated 
thymidine. Stimulation indices for responses by T cells to peptides were calculated as the 
maximum CPM in response to a peptide divided by the control CPM. A stimulation index 
(S.I.) equal to or greater than two times the background level was considered "positive". 
Positive results were used to calculate the mean stimulation index for each peptide for the 
group of patients tested (See Figure 25). Preferred peptides of this invention comprise at 
least one T cell epitope and have a mean T cell stimulation index of greater than or equal to 
2.0. A peptide having a mean T cell stimulation index of greater than or equal to 2.0 in a 
significant number of dog dander allergen sensitive patients tested is considered useful as a 
therapeutic agent Preferred peptides have a mean T cell stimulation index of at least 2.5, 
more preferably at least 3.0, more preferably at least 3.5, more preferably at least 4.0, more 
1 5 preferably at least 5.0 and most preferably at least about 6. For example, peptides having a 
Caaf I activity and having a mean T cell stimulation index of at least 5, as indicated by data 
shown in Figure 25, include Construct 1 (SEQ ID NO:105), Construct 2 (SEQ ID NO:106), 
and Construct 3 (SEQ ID NO.107). T cell epitopes can also be predicted and determined as 
described above for peptides derived from £an_fH. 
20 In addition, preferred peptides have a positivity index (P.I.) of at least about 60, more 

preferably about 100, more preferably at least about 200 and most preferably at least about 
300. The positivity index for a peptide is determined by multiplying the mean T cell 
stimulation index by the percent of individuals, in a population of individuals sensitive to dog 
dander allergens (e.g., preferably a population of at least 12 individuals, more preferably a 
25 population of at least 30 individuals or more), who have a T cell stimulation index to such 
peptide of at least 2.0. Thus, the positivity index represents both the strength of a T cell 
response to a peptide (S.I.) and the frequency of a T cell response to a peptide in a population 
of individuals sensitive to dog dander allergens. In Figure 10, the bar represents the 
positivity index and the percent of individuals tested who have a T cell stimulation index of 
30 at least 2.0 to various peptides derived from Can f I. For example, as shown in Figure 25, 
Peptide A0095 has a mean S.I. of 3.0 and 43% of positive responses in the group of 
individuals tested resulting in a positivity index of 129. 

In order to determine precise T cell epitopes by, for example, fine mapping 
techniques, a peptide having Canf l or Can f II T cell stimulating activity and thus 
35 comprising at least one T cell epitope as determined by T cell biology techniques is modified 
by addition or deletion of amino acid residues at either the amino or carboxy terminus of the 
peptide and tested to determine a change in T cell reactivity to the modified peptide. 
Following this technique, peptides are selected and produced recombinantly or synthetically. 
Peptides are selected based on various factors, including the strength of the T cell response to 
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the peptide (e.g., stimulation index), the frequency of the T cell response to the peptide in a 
population of individuals sensitive to dog dander allergens, and the potential cross-reac ivity 
of the peptide with other dog dander allergens. The physical and chemical properties of these 
selected peptides (e.g., solubility, stability) ate examined to determine whether the peptides 
5 are suitable for use in therapeutic compositions or whether the peptides require modification 
as described herein. The ability of the selected peptides or selected modified peptides to 
stimulate human T cells (e.g., induce proliferation, lymphpkine secretion) is then determined 

as described herein. j * 

In another embodiment, a peptide having aCanf I or Caaf II activity is screened for 

~ cells (as determined by one or more of the above described assays), to inhibit or completely 
^ block the activity of purified native £ajLf I or £an_f II or portion thereof and induce a state of 

non-responsiveness can be determined using subsequent attempts at stimulation of the T ceUs 

wimantigenpresentingce^^ 

15 or Ci^II activity fol^ If 
the T cells are unresponsive to the subsequent activation attempts, as determined by 
interleukin-2 synthesis and/or T cell proliferation, a state of non-responsiveness has been 
induced. Ses, e.g., Gimmi glaL (1993) Pmc Nlti Add . M USA, 2fl: 6586 ■ 6590; and 
Schwartz (1990) SfiiSDCS, 24& 1349 - 1356, for assay systems that can be used as the basis 
20 for an assay in accordance with the present invention. 

In yet another embodiment, peptides having aCanil or Caaf H activity are 
identified by IgE binding activity. For therapeutic purposes, peptides of the invention 
preferably do not bind IgE specific for a dog dander allergen, or bind such IgE to a 
substantially lesser extent than the corresponding purified native dog dander allergen binds 
25 such IgE. Reduced IgE binding activity refers to IgE binding activity that is less than that of 
purified r^tiveCaafl or Canin protein. If a peptide having a£an_f I or £an_f II activity is 
to beused as a diagnostic reagenUt is not necessary that the ^ peptide have reduced IgE 
binding activity compared to the native Qm£ I or Caaf II allergen. IgE binding activity of 
peptides can be determined by, for example, an enzyme-linked immunosorbent assay 
30 (ELISA) using, for example, sera obtained from a subject, (i.e., an allergic subject) that has 
been previously exposed to the native Can! I or Caaf II allergen. Briefly, the peptide 
suspected of having a Caaf I or Can! II activity is coated onto wells of a microtiter plate. 
After washing and blocking the wells, antibody solution consisting of the plasma of an 
allergic subject who has been exposed to a peptide suspected of having a £aai I or Cani II 
35 activity is incubated in the wells. The plasma is generally depleted of IgG before incubation. 
A labeled secondary antibody is added to the wells and incubated. The amount of IgE 
binding is then quantified and compared to the amount of IgE bound by a purified, native 
Canfl or CanJII protein. Alternatively, the IgE binding activity of a peptide can be 
determined by Western blot analysis. For example, a peptide suspected of having a£an_f I or 
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Can f II activity is run on a polyacrylamide gel using SDS-PAGE. The peptide is then 
transferred to nitrocellulose and subsequently incubated with sera from an allergic subject. 
After incubation with a labeled secondary antibody, the amount of IgE bound is then 
determined and quantified. 
5 Another assay which can be used to determine the IgE binding activity of a peptide is 

a competition ELISA assay. Briefly, an IgE antibody pool is generated by combining plasma 
from dog dander allergic subjects that have been shown by direct ELISA to have IgE reactive 
with native CaiLf I or Qml II. This pool is used in ELISA competition assays to compare 
IgE binding nf native Can f I or £aof II and a peptide suspected of having a Qsnl I or Qan£ 
10 II activity. IgE binding for the native £anl I or Qsnl II protein and a peptide suspected of 
having a Can f I or Can f II activity is determined and quantified. 

If a peptide having an activity of Qsnl I or Qsnl II binds IgE, and is to be used as a 
therapeutic agent, it is preferable that such binding does not result in the release of mediators 
(e.g., histamines) from mast cells or basophils. To determine whether a peptide which binds 
15 IgE results in the release of mediators, a histamine release assay can be performed using 
standard reagents and protocols obtained, for example, from Amac, Inc. (Westbrook, ME). 
Briefly, a buffered solution of a peptide suspected of having a Qsnl I or£aiLf II activity is 
combined with an equal volume of whole heparinized blood from an allergic subject. After 
mixing and incubation, the cells are pelleted and the supernatants are processed and analyzed 
20 using a radioimmunoassay to determine the amount of histamine released. 

Peptides having an activity of Can f I or Canl II which are to be used as therapeutic 
agents are preferably tested in mammalian models of dog dander atopy, such as the mouse 
model disclosed in Tamura et al.. (1986) Microbiol, Immunol,, 2Q: 883 - 896, or in U.S. 
Patent 4,939,239, or in the primate model disclosed in Chiba £LsL (1990) Tnt, Archt AlkrgY 
25 Immunol. . 21: 83 - 88. Initial screening for IgE binding to a peptide having an activity of 
Canf I or Canf II may be performed by scratch tests or intradermal skin tests on laboratory 
animals or human volunteers, or in in vitro systems such as RAST, RAST inhibition, ELISA 
assay, RIA (radioimmunoassay), or a histamine release assay, as described above. 

It is possible to modify the structure of a peptide having an activity of Qsnl I or Qanf 
30 II for such purposes as increasing solubility, enhancing therapeutic or prophylactic efficacy, 
or stability (e.g., shelf life ex vivo and resistance to proteolytic degradation in vivo). Such 
modified peptides are considered functional equivalents of peptides having an activity of Qsn 
f I or £an_f II as defined herein. A modified peptide can be produced in which the amino 
acid sequence has been altered, such as by amino acid substitution, deletion, or addition, to 
35 modify immunogenicity and/or reduce allergenicity, or to which a component has been added 
for the same purpose. 

For example, a peptide having an activity of Can f l or Canf II can be modified so 
that it maintains the ability to induce T cell non-responsiveness and bind MHC proteins 
without the ability to induce a strong proliferative response or possibly, any proliferative 
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response when administer*) in immunogenic torn. In this instance, critical binding residues 
^1. receptor functioncanbedeu^ 

"eh residue and determination of the presence or absence of T eel, «*n£ 
residues shown to be essential to interact with the T cell receptor can be modified by 

• vlLtcorpLonmay^ 

10 J b^^ ,„ u jiwk, 

* AoMMonaUy^denavtag^activityof^ 

-replacmg^ar^ 

, ^ preferably simUarardnoacidresidue^ 
I^oenhancidin^hbutnotdimina^ or not affect T cell activity. In addmon, arnmo 

incorporation may enhance, no, affe* or d^b« not el^ T cell rea^. 

to substitutions vrift alanine, glutamic acid, or a methyl ammoacid. 

Another example of modification of a peptide having an actmty of Cjaf 1 or Caai H 
is substitution of cysteine residues ^Hy^^,^^^"'*™ 0 ' . 

acid side chains of fragments of the protein of the invention can be chemi^ly modified. 
Another modification is cyclization of the peptide. 
5 , lnor dertoenhancestabilityano7orr^^ 

or Caol-n can be modified to incorporate one or more polymorphisms in the amino acid 
^Tce cf the protein allergen renting from any natural al.eiic variation. Addmonally^- 
H 0 acids, noLntral amino acids, or non-amino acid analogs can b, substittfed or 

,0 tavinganactivity ofCmil orCmf II can be modified using polyethylene glycol (PEG) 

conjugated with PEG. In addition, PEG can be added during chemical synthesis ^® 
proLm. Othermodifications of a peptide having an «tivity of CmflorC^f II include 
reduction/alkylation CTarr, of Protnn Mif rfKh^erizalion, 3. E. Silver ed., 

35 Humana Press, Clifton NJ 155-194 (1986))-. acylation (T*r, art «1 coup ing toan 
appropriate carrier (Mishell and Shiigi, eds, 1 li rN M"tW to r f lll#r linmmlogy, WH 
Freeman, San Francisco, CA (1980), U.S. Patent 4,939,239; or mild formalin treatment 
(Marsh, (1971) i n ) Arrh of Allergy imfl ftppl Immunol , 41; 1» - 215). 
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To facilitate purification and potentially increase solubility of a peptide having an 
activity of Can f I or Can f II. it is possible to add an amino acid fusion moiety to the peptide 
backbone. For example, hexa-histidine can be added to the protein for purification by 
immobilized metal ion affinity chromatography (Hochuli, E. fiLaL, (1988) Bio/Technology, 
5 <£: 1321 - 1325). In addition, to facilitate isolation of peptides free of irrelevant sequences, 
specific endoprotease cleavage sites can be introduced between the sequences of the fusion 
moiety and the peptide. In order to successfully desensitize a subject to QmS I or CanJf n 
* protein or related allergen, it may be necessary to increase the solubility of the protein by 
adding functional groups to the protein, or by omitting hydrophobic regions of the protein. 
10 ~ To potentially aid proper antigen processing of T cell epitopes within £an_f I or Can f 
II, canonical protease sensitive sites can be engineered between regions, each comprising at 
- - least one T cell epitope via recombinant or synthetic methods. For example, charged amino 
acid pairs, such asKKor RR, can be introduced between regions within a protein or fragment 
during recombinant construction thereof. The resulting peptide can be rendered sensitive to 
1 5 cleavage by cathepsin and/or other trypsin-like enzymes which would generate portions of 
the protein containing one or more T cell epitopes. In addition, such charged amino acid 
residues can result in an increase in the solubility of the peptide. 

Site-directed mutagenesis of a nucleic acid encoding a peptide having an activity of 
Can f I or£an_f II can be used to modify the structure of the peptide by methods known in 
20 the art. Such methods may, among others, include polymerase chain reaction (PCR) with 
oligonucleotide primers bearing one or more mutations (Ho et al. . (1 989) Gene, 11: 5 1 - 59) 
or total synthesis of mutated genes (Hostomsky, Z. £LaL,(1989) Bioghem. Biophys. Res, 
Cornm, 161 : 1056 - 1063). To enhance recombinant protein expression, the aforementioned 
methods can be applied to change the codons present in the cDNA sequence of the invention 
25 to those preferentially utilized by the host cell in which the recombinant protein is being 
expressed (Wada gUl„ supra). 

Another aspect of the invention pertains to an antibody specifically reactive with a 
peptide having an activity of Canf I or Canf IL The antibodies of this invention can be used 
to standardize allergen extracts or to isolate the naturally-occurring or native form of Canf I 
30 or Canf II. For example, by using peptides having an activity of Canf I or Qmf II based on 
the cDNA sequence of Canf I or Can f IL anti-protein/anti-peptide antisera or monoclonal 
antibodies can be made using standard methods. A mammal such as a mouse, a hamster or 
rabbit can be immunized with an immunogenic form of the peptide (e.g., Csn.f I or Can f II 
protein or an antigenic fragment which is capable of eliciting an antibody response). 
35 Techniques for conferring immunogenicity on a protein or peptide include conjugation to 

carriers or other techniques well known in the art. A peptide having an activity of Canf I or 
Can f II can be administered in the presence of adjuvant. The progress of immunization can 
be monitored by detection of antibody titers in plasma or scrum. Standard ELISA or other 
immunoassay can be used with the immunogen as antigen to assess the levels of antibodies. 
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Follow taction, anti-Gml 1 or anti-Caof II antisera can be obtained and, if 
desired polyclonal anti-£an_f I or anti-CjSILf II antibodies isolated fiom the S ^ U " L T ° 
X monoclonal antibodies, antibody producing ceUs (.ymphocytes) can be harvested 
from an immunized anima. and fused by standard somatic cell fuston procures w«h 
5 Sizing cells such as myeloma cells to yield hybridoma cells Such technics « 

' cellhytm*>ma.e«hm q ue(Kozbar^a^^^^ 
^riZatecranquetoprodu^ 

u"u^r^ ' 
^antibody as used herein is intended to include fragments «=r«>wh.ch are 

„kn specifically reactive with the pepudehaving an activity of QuLflorCaofll. 

Xinmesamen^asdescribedabovefcrv^oleantibodteFor^ 
Clcan^generaudbyu^g^bodywiu.pepsu, ^^^^ 
entreated to .educe disulfide bridges to produce Fab' fragments. The antibody of me 
^tt^tion is further intended to include bispecific and chimeric moiecu.es havtng an 

specmcauy eenetically identical to one another and expressing identical T 

25 . *~» stimulation withaCaaf I orEmXII protein or peptide having an a«iv«y 

Vit ^Z^^ W i^^V^^<>^^ Alternatively ^Qi lor 
S^Lfic T-T hybridomas can be produced by a technique similar to B 
30 Potion. ^.^..^^-.^hta^^.P^"-"* 

an^nX of Canil or aafn, T cells are then purified and fused with an autonomously 
"Tceu^norUne. From tiie resulting hybridomas, ce,ls respouduig to a peptide 
teving an activity of Cmf 1 or CaaiH are selected and cloned. Procedures from 
gating monoclonal T cell populous are described in 
35 ZllJ* (Abul K. Abbas sUL ed.), W.B. Saunders Company, Philadelplua, PA (1991) 

T eel, receptors specifically reactive with a peptide having an activn, of 
rL , or CmW can be obtained by hnmunoprecipitation using an antibody again* tiie 
^ .o^described mln,— : (Second Edition) Edward S. Golub 

ei^L, ed., Sinauer Associates, Inc., Sunderland, MA (1991) pages 366-269. 
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T cell clones specifically reactive with a peptide having an activity of Can f I or Can f 
II can be used to isolate and molecularly clone the gene encoding the relevant T cell receptor. 
In addition, a soluble T cell receptor specifically reactive with a peptide having an activity of 
Can f l or Canf H can be used to interfere with or inhibit antigen-dependent activation of the 

5 relevant T cell subpopulation, for example, by aanunistration to an individual sensitive to 
Can f l or Caaf II. Antibodies specifically reactive with such a T cell receptor can be 
produced according to the techniques described herein. Such antibodies can be used to block 
or interfere with the T cell interaction with peptides presented by MHC. 

Exposure of allergic subjects to peptides having an activity of CanJI or Can f II and 

10 which have T cell stimulating activity, may cause the appropriate T ceU subpopulations to 
become non-responsive to the respective protein allergen (e.g., fail to stimulate an immune 
response upon such exposure). In addition, such administration may modify the lymphokine 
secretion profile as compared with exposure to the naturally-occurring protein allergen or 
portion thereof (e.g., result in a decrease of IL-4 and/or an increase in IL-2). Furthermore, 

15 exposure to peptides Having an activity of£an_f I or CaD_f II which have T cell stimulating 
activity may influence T cell subpopulations which normally participate in the response to 
the allergen such that these T cells are drawn away from the site(s) of normal exposure to the 
allergen (e.g., nasal mucosa, skin, and lung) towards the site(s) of therapeutic administration 
of the protein or fragment derived therefrom. This redistribution of T cell subpopulations 

20 may ameliorate or reduce the ability of an individual's immune system to stimulate the usual 
immune response at the site of normal exposure to the allergen, resulting in a diminution in 
allergic symptoms. 

A peptide having an activity of Can f I or Can_f II when administered to a subject 
sensitive to dog dander allergens is capable of modifying the B cell response, T cell response, 

25 ~ or both the B cell and the T cell response of the subject to the allergen. As used herein, 

modification of the allergic response of a subject to a dog dander allergen can be defined as 
non-respohsiveness or diminution in symptoms to the allergen, as determined by standard 
clinical procedures (See e.g., Vamey eJUL, (1990) British Medical Journal, 202: 265 - 269), 
including diminution in dog dander induced asthmatic symptoms. As referred to herein, a 

30 diminution in symptoms includes any reduction in the allergic response of a subject to the 
allergen following a treatment regimen with a peptide of the invention. This diminution in 
symptoms may be determined subjectively (e.g., the patient feels more comfortable upon 
exposure to the allergen), or clinically, such as witha standard skin test 

Peptides or antibodies of the present invention can also be used for detecting and 

3 5 diagnosing sensitivity to Canf I or Can f II. For example, this could be done by combining 
blood or blood products obtained from a subject to be assessed for sensitivity with peptide 
having an activity of Caaf I or Can! II, under conditions appropriate for binding of 
components in the blood (e.g., antibodies, T cells, B cells) with the peptide(s) and 
determining the extent to which such binding occurs. Other diagnostic methods for allergic 
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diseases which the peptides or antibodies of the present invention can be used include radio- 
allergosorbent test (RAST), paper radioimmunosorbent test (PRIST), enzyme linked 
immunosorbent assay (ELISA), radioimmunoassays (RIA), immuno-radiometnc assays 
(IRMA), luminescence immunoassays (LIA), histamine release assays and IgE immunoblots. 
5 The present invention further provides methods of detecting and treating sensitivity in 

a subject to Cani I or CanJII. The presence in subjects of IgE specific for Qm£ I or £anl 
II and the ability of T cells of the subjects to respond to T cell epitopes of Cani l or £anl II 
" can be determined by administering to the subject an Immediate Type Hypersensitivity test 
and/or a Delayed Type Hypersensitivity test (SfiS e.g., Trnrnnr i ologv (1985) Roitt, I.M., 
10 Bfostoff, J., Male, D.K. (eds), C.V. Mosby Co., Gower Medical Publishing, London, ^ 
192-19 18- pp 22.1-22.10) utilizing a peptide having an activity of Canil or Can! H, or a 
-* modified form of a peptide having an activity of £atf l or CllD^ 

specific for the allergen. The same subjects are administered a Delayed Type 
Hypersensitivity test prior to, simultaneously with, or subsequent to adnunistration of the 
15 Immediate Type Hypersensitivity test Of course, if the Immediate Type Hypersensitivity 
test is aoimmstered prior to the Delayed Type Hypersensitivity test, the Delayed Type 
Hypersensitivity test would be given to those subjects exhibiting a specific Immediate Type 
Hypersensitivity reaction. The Delayed Type Hypersensitivity test utilizes a peptide having 
an activity of Caaf I or Caul II which has human T cell stimulating activity and which does 
20 not bind IgE specific for the allergen in a substantial percentage of the population of subjects 
sensitive to the allergen (e.g., at least about 75%). Those subjects found to have both a 
specific Immediate type Hypersensitivity reaction and a specific Delayed Type 
Hypersensitivity reaction are administered an amount of a composition suitable for 
pharmaceutical administration. The composition comprises the peptide having an activity of 

25 . ;s .CjffiJIorCiir^ 

acceptable carrier or diluent. 

A peptide having an activity of Qsnll or CjiLfll can be used in methods of 
diagnosing, treating, and preventing allergic reactions to a dog dander allergen or a cross- 
reactive protein allergen. Thus, the present invention provides compositions suitable for 
30 pharmaceutical administration comprising an amount of at least one peptide having an 

activity of Caaf I or Caai II and a pharmaceutical^ acceptable carrier. Administration of 
the compositions of the present invention to a subject to be desensitized can be carried out 
using known procedures, at dosages and for periods of time effective to reduce sensitivity 
(i.e., reduce the allergic response) of the subject to a dog dander allergen. The term subject is 
35 intended to include living organisms in which an immune response can be elicited, e.g., 
mammals. Examples of subjects include humans, dogs, cats, mice, rats, and transgenic 
species thereof. An amount of at least one peptide having an activity of£an_f I or Canf II 
necessary to achieve a therapeutic effect may vary according to factors such as the degree of 
sensitivity of the subject to dog dander, the age, sex, and weight of the subject, and the ability 
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of a peptide having an activity of Canf I or £an_f II to elicit an antigenic response in the 
subject Dosage regima may be adjusted to provide the optimum therapeutic response. For 
example, several divided doses may be administered daily or the dose may be proportionally 
reduced as indicated by the exigencies of the therapeutic situation. 
5 The active compound (i.e., a peptide having an activity of Can f I or Canf II) may be 

administered in a convenient manner such as by injection (subcutaneous, intravenous, etc.), 
oral administration, inhalation, transdermal application, or rectal administration. Depending 
on the route of administration, the active compound may be coated in a material to protect the 
compound from the action of enzymes, acids and other natural conditions which may 
10 inactivate the compound. 

To administer a peptide having an activity of QmS I or Canf H by other than 
parenteral administration, it may be necessary to coat the peptide with, or co-administer the 
peptide with, a material to prevent its inactivation. For example, a peptide having an activity 
nf Can f I or ran f II may be administered to an individual in an appropriate carrier, diluent 
15 or adjuvant, co-admimstered wito enzyme 

liposomes. Pharmaceutically acceptable diluents include saline and aqueous buffer solutions. 
Adjuvant is used in its broadest sense and includes any immune stimulating compound such 
as interferon. Adjuvants contemplated herein include resorcinols, non-ionic surfactants such 
as polyoxyethylene oleyl ether and n-hexadecyl polyethylene ether. Enzyme inhibitors 
20 include pancreatic trypsin inhibitor, diisopropylfluorophosphate (DEP) and trasylol. 

Liposomes include water-in-oil-in-water CGF emulsions as well as conventional liposomes 
(Strejan et al. . (1984) 1 Neiimimmunol.. 2: 27). For purposes of inducing T cell 
nonresponsiveness, the composition is preferably administered in non-immunogenic form, 
e.g., one that does not contain adjuvant. 
25 The active compound may also be administered parenterally or intraperitoneally . 

Dispersions can also be prepared in glycerol, liquid polyethylene glycols, and mixtures 
thereof and in oils. Under ordinary conditions of storage and use, these preparations may 
contain a preservative to prevent the growth of microorganisms. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
30 solutions (where water soluble) or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersion. In all cases, the composition must be 
sterile and must be fluid to the extent that easy syringability exists. It must be stable under 
the conditions of manufacture and storage and must be preserved against the contaminating 
action of microorganisms such as bacteria and fungi. The carrier can be a solvent or 
35 dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, 
propylene glycol, and liquid polyetheylene glycol, and the like), suitable mixtures thereof, 
and vegetable oils. The proper fluidity can be maintained, for example, by the use of a 
coating such as lecithin, by the maintenance of the required particle size in the case of 
dispersion and by the use of surfactants. Prevention of the action of microorganisms can be 
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achieved by various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many cases, it will be 
preferable to include isotonic agents, for example, sugars, polyalcohols such as mannitol, 
sorbitol sodium chloride in the composition. Prolonged absorption of the injectable 
5 compositions can be brought about by including in the composition an agent which delays 
absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating active compound (i.e., a 
* peptide having an activity ofCaafl or DmfH) in the required amount in an appropriate 
solvent with one or a combination of ingredients enumerated above, as required, followed by 
10 filtered sterilization. Generally, dispersions are prepared by incorporating the active 
T compound into a sterile vehicle which contains a basic dispersion medium and the required 

other ingredients from those enumerated above. m the case of sterile powders for the 
* preparation of sterile injectable solutions, the preferred methods of preparation are vacuum 
drying and freeze-drying which yields apowder of the active ingredient (i.e., at least one 
15 peptide having an activity of£an_f I or CanJII) plus any additional desired ingredient from a 
previously sterile-filtered solution thereof. 

When the peptide having an activity of Qml I or £anf II is suitably protected, as 
described above, the peptide may be orally administered, for example, with an inert diluent or 
an assimilable edible carrier. The peptide and other ingredients may also be enclosed in a 
20 hard or soft shell gelatin capsule, compressed into tablets, or incorporated directly into the 
individual's diet For oral therapeutic administration, the active compound may be 
incorporated with excipients and used in the form of ingestible tablets, buccal tablets, troches, 
capsules, elixirs, suspensions, syrups, wafers, and the like. The percentage of the 
compositions and preparations may, of course, be varied and may conveniently be between 
25 about 5 to about 80% of the weight of the unit. The amount of active compound m such 
^ therapeutically useful compositions is such that a suitable dosage will be obtamed. 

As used herein "pharmaceutically acceptable carrier" includes any and all solvents, 
dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption 
delaying agents, and the like. The use of such media and agents for pharmaceutically active 
30 substances is well known in the art. Except insofar as any conventional media or agent is 
incompatible with the active compound, use thereof in the therapeutic compositions is 
contemplated. Supplementary active compounds can also be incorporated into the 

compositions. . 

It is especially advantageous to formulate parenteral compositions m dosage unit form 
35 for ease of administration and uniformity of dosage. Dosage unit form as used herein refers 
to physically discrete units suited as unitary dosages for the mammalian subjects to be 
treated- each unit containing a predetermined quantity of active compound calculated to 
produce the desired therapeutic effect in association with the required pharmaceutical earner. 
The specification for the dosage unit forms of the invention are dictated by and directly 
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dependent on (a) the unique characteristics of the active compound and the particular 
therapeutic effect to be achieved, and (b) the limitations inherent in the art of compounding 
such an active compound for the treatment of sensitivity in subjects. 

The present invention also provides a composition comprising at least two peptides 
5 having an activity of Canf I or Caul II (e.g., a physical mixture of at least two peptides), 
each having T cell stimulating activity. For example, at least two peptides each having as 
activity of Canll can be combined or at least two peptides each having an activity of Canf II 
can be combined, or at least one peptide having an activity of Canf I and at least one peptide 
having an activity of Canf II can be combined and adrrunistered. Alternatively, a peptide 
10 having at least two regions, each havirig T ceU stimmating activity (i.e., each region 

comprising at least one T cell epitope) can be administered to an allergic subject. Such a 
peptide can have at least two regions derived from the same allergen, Canf I or Canf II, or a 
combination of Canf I and Canf II. A composition of two peptides or a peptide having at 
least two regions can be administered to a subject in the form of a composition with a 
15 pharmaceutical^ acceptable carrier as hereinbefore described. An amount of one or more of 
such compositions can be adimnistered simultaneously or sequentially to a subject sensitive 
to a dog dander allergen to treat such sensitivity. 

The cDN A (or the mRNA which served as a template during reverse transcription) 
encoding a peptide having an activity of Canf I or Canf II can be used to identify similar 
20 nucleic acid sequences in any variety or type of animal and, thus, to molecularly clone genes 
which have sufficient sequence homology to hybridize to the cDNA encoding a peptide 
having an activity of Canf I or Canf II. Thus, the present invention includes not only 
peptides having an activity of Canf I or Canf II, but also other proteins which may be 
allergens encoded by DNA which hybridizes to DN A of the present invention. 
25 Isolated peptides that are immunologically related to Canf I or Canf II, such as by 

antibody cross-reactivity or T cell cross-reactivity, other than those already identified, are 
within the scope of the invention. Such peptides bind antibodies specific for the protein and 
peptides of the invention, or stimulate T cells specific for the protein and" peptides of this 
invention. 

30 A peptide having an activity of Canf I or Canf II (i.e., Canf I or Canf II produced 

recombinantly or by chemical synthesis) is free of all other dog dander proteins and, thus, is 
useful in the standardization of allergen extracts which are key reagents for the diagnosis and 
treatment of dog dander hypersensitivity. In addition, such a peptide is of a consistent, well- 
defined composition and biological activity for use in preparations which can be administered 

35 for therapeutic purposes (e.g., to modify the allergic response of a subject sensitive to dog 
dander). Such peptides can also be used to study the mechanism of immunotherapy of Canis 
familiaris allergy and to design modified derivatives or analogs useful in immunotherapy. 

Work by others has shown that high doses of allergen extracts generally produce the 
best results during immunotherapy (i.e., best symptom relief). However, many subjects are 
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unable to tolerate large doses of such extracts due to systemic reactions elicited by the 
allergens and other components within these preparations. A peptide having an activity of 
CBLf I or Can! II of the invention has the advantage of being free of all other dander protein. 
Thus, such a peptide can be administered for therapeutic purposes. 
5 ' It is now also possible to design an agent or a drug capable of blocking or inhibiting 

the ability of a dog dander allergen to induce an allergic reaction in sensitive subjects. Such 
agents could be designed,for example, in such a manner that they would bind to relevant 
* anti-Canil or anti-CaniH IgE molecules, thus preventing IgE-allergen binding, and 
subsequent mast cell/basophil degranulation. Alternatively, such agents could bind to 
10 celTular components of the immune system, resulting in suppression or deserialization of the 
allergic responses to dog dander allergens. A non-restrictive example of this is the use of 
-- peptides including B or T cell epitopes of Canil or Canill, or modifications thereof, based 
on the cDNA protein structure of Canil orCanill to suppress the allergic response to a dog 
dander allergen. This could be carried out by defining the structures of fragments encoding B 
15 and T cell epitopes which affect B and T cell function in in vitro studies with blood 
components from subjects sensitive to dog dander. 

The invention is further illustrated by the following examples which should not be 
construed as further limiting the subject invention. The contents of all references and 
published patent applications cited throughout this application are hereby incorporated by 
20 reference. 

yr; y ^MPT.F. 1 : y r nt«n Sequence Analysis of Pnrififd Can fl 

Affinity purified Canil protein was obtained from Dr. Aalberse (de Groot, H. eLaL, 
supra) An Applied Biosystems Model 477A gas phase sequencer with on-line 

25 - phynylthiohydantoin (HTH) amino acid analysis (Model 120A) was used to sequence the 
purified CaiLf I protein. A modification of the extraction program, multiple butylchloride 
extraction, was used to improve me amino acid recovery. O-phthaladehyde (OPA) was used 
in blocking of primary amines when proline was located at the amino terminus. Brauer, 
AW sLsL, (1984) Rinchemistrv. 122: 134, 142. In situ alkylation was performed by 

30 using the non-nucleophilic reductant, tributylphosphine with concomitant alkylation by 4- 
vinyl pyridine in ethylmorphotine buffer. Andrews, P.C. and Dixon, J.E., (1987) AnaL 

Rinchemistrv. lfil: 524 - 528. 

Using this methodology, the sequence of the N^erminus of the Canil protein was 
determined, contrary to previous reports that the N-terminus is blocked (Schou, C. sUL, 
35 sum) The N-terminal sequence of 65 amino acid residues which was identified through 
multiple N-terminal sequence analysis in conjunction with OPA blocking of contaminating 
signal represents a novel protein sequence (Figure 5). The Canil protein sequence was 
confirmed and expanded by sequence analysis of CNBr cleaved peptides. In situ CNBr 
digestion of Canil on the sequence glass filter disk provided additional protein sequence 
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information. Simpson, R.J. and Nice, E.C., (1984) Biochem, International, &: 787 - 791 . 
Prior to the in situ CNBr cleavage, forty-four cycles of amino acid sequencing were 
performed and then the protein sample was treated with OPA to block all amino groups. 
After five hours of in situ CNBr digestion, three major sequences were identified 

5 corresponding to the sequences after Met46, Met30 and an unknown Met, later shown to be 
Metl03. An additional OPA block after cycle 1 8 (before Pro65) extended the sequence to 
Asp86. Sequence analysis of CNBr peptide fragments isolated by HPLC (Applied 
Biosystem, Inc., Model 130, C8 Column) further extended the N-terrninal sequence to 
ninety-four amino acid residues. In situ CNBr cleavage in conjunction with OPA blocking 

10 also identified a 39 amino acid residue peptide (residues 104-142). A potential N- 

glycosylation site was found in the cDNA deduced amino acid sequence, Asn54 - Ile55 - 
Thr56. The protein sequence analysis identified the Ile55 and Thr56 of-Canf I, however, 
nothing could be identified at the position 54. This suggests that post-translation 
modification occurs at Asn54 of Canf I and the modification is stable to the trifluoroacetic 

15 acid treatment during protein sequencing. 

FYAMPI.E ?r- F/rtractio n of mRNA From Canine Pgrotid Glands 
ffF iii Pinning of Can f I 
A pair of fresh parotid glands from a single outbred dog were obtained from the Tufts 
20 University School of Veterinary Medicine (Worcester, MA) and, washed in phosphate 
buffered saline, and immediately frozen on dry ice. RNA was extracted essentially as 
described in the literature (Chirgwin, J. M. eLaL, (1979) Biochemistry, JL£: 5294 - 5299.). 
One gland was pulverized to a powder with a mortar and pestle frozen in liquid N2, and 
suspended in 25 ml of GTC buffer (50% w/v guanidine thiocynate, 0.5% w/v Na lauryl 
25 sarcosine, 0.7% v/v B-mercaptqethanol, 0.1% v/v Sigma Antifoam A, 25 mM Na citrate, pH 
7.0) and vortexed until dissolved. Genomic DNA present in the solution was sheared by 
forcing the solution through a 1 6 gauge needle until the viscosity of the solution no longer 
decreased. The sheared solution was centrifuged at 3 K rpm for 5 minutes at room 
temperature. The supernatant was then sheared further through a 23 gauge needle until its 
30 viscosity no longer decreased, and cleared by centrifugation at 5 K rpm for 5 minutes at room 
temperature. The solution was layered onto a CsCi cushion (5.7 M CsCl, 10 mM EDTA pH 
7.5) and ultracentrifuged in a Beckman SW 41 Ti rotor at 35 K rpm for 1 6 hours at 20° C. 
The supernatant was discarded and the RNA pellet/vashed in 70% EtOH then resuspended in 
0.3 M NaOAc, 10 mM EDTA, 0.1% SDS. Two volumes of absolute EtOH were added, and 
3 5 precipitation carried out on dry ice. RNA was pelleted by centrifugation, 70% EtOH washed, 
and resuspended in TES (10 mM Tris, 1 mM EDTA, 0.1% SDS). The final yield was ~1 .8 
mg. 

Single strand total dog parotid gland cDNA was synthesized using the above RNA 
preparation as a template in reverse transcription. 4 ug of total RNA were EtOH. precipitated 
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(using glycogen as carrier: molecular biology grade Boerhinger Manheim), 70% EtOH 
washed, and resuspended in 10 ul of dH20. Oligo dT(12-18) was added to 50 ug/ml and the 
RN A denatured at 70° C for 5 minutes. The reaction was quick chilled on ice and 1 ul (40 
units) of RNAsin (Promega) was added as a prophylactic against contaminating RNases. The 
5 components from the BRL Superscript™ Reverse Transcriptase Kit were added as follows: 4 
ul 5 X buffer, 2 ul 0.1 M DTT, 1 ul 10 mM dNTP mix. After warming the reaction to 37° C, 
1 ul (200 units) of Superscript™ Reverse Transcriptase was added, and the reaction allowed 
" to .proceed for one hour at 37° C. Reverse transcription was tenninated by incubation at 70° 
C for 15 minutes, and the reaction stored at -20° C. 
10 - Initially the MOPAC (mixed oligonucleotides primed amplification of cDNA) 
technique of PCR amplification (Lee, C. C. sUl, (1988) 23* 1288 -1291) was 

*4used to obtain a partial cDNA of Can! I encoding amino acids 14 to 29 of the mature protein. 
Using dog parotid cDNA as a template with degenerate primer pairs (synthesized on an 
Applied Biosystems 392) based on residues 9 to 1 5 (SEQ ID NO:3) [Fig. 1 , .S 1 A (SEQ ID 
15 NO-4) or SIB (SEQ ID NO:5)] and 30 to 37 (SEQ ID NO:I0) [Figure 1, AS2A (SEQ ID 
NO l 1) or AS2B (SEQ ID NO:12)] of mature CanJ I, a DNA fragment of the predicted size 
(~3 X 28 amino acids, or 84 bp) could be amplified using a PCR kit (GeneAmp kit, Perkm 
Elmer Cetus, Norwalk, CT) in conjunction with the following program in an MJ Research 
Minicycler40X(92°C30secondV55 e Cl minutesr75 o Clminutes). The pnmer pair 5' 

20 S1B/3'AS2B amplified me predicted fra^^ 

both coding regions, the leucine residue was encoded by CTX rather than TT(A or G). As a 
test of its authenticity, the amplified fragment hybridized on a Southern blot to an internal 
degenerate oligonucleotide probe Dog probe 1 (SEQ ID NO:7), [based on Canf I residues 17 
to 24 (SEQ ID NO:6)] that had been end labeled with Y -32P ATP using T4 polynucleotide 
25 kinase After subcloning of the amplified fragment into Bluescript KS plasmid vector 
. (Stratagene, San Diego, CA), ft was sequenced using a Sanger dideoxy termination kit (USB 
Cleveland, OH) and shown to correctly encode residues 16 to 29 of mature Canf I. 

Similarly, when amino acid sequence analysis of purified Canf I yielded sequence 
information extending to residue 94 of the mature protein, new primer pairs were used in 
30 MOPAC PCR amplification of an extended partial Canf I cDNA (residues 14 to 87). The 5 
or sense primers SA (residues 14 to 20) and SB (residues 21 to 27) were a nested pair based 
on the known CmS I partial cDNA sequence, while the 3' or antisense primers, AS3 A (SEQ 
ID NO-13) (Figure 1) and AS3B (SEQ ID NO:14) (Figure 1) were degenerate 
oligonucleotides based on residues 88 to 94. In sequential rounds of PCR (1/lOOth of the 
35 first reaction was used as template for the second reaction) using conditions described above 
in a pair of successive reactions using nested 5' sense oligos in conjunction with a single 3 
antisense degenerate primer, a DNA fragment of the predicted size (-3 X 80 amino acids, or 
240 bp) could be amplified. Degenerate 3' antisense oligo AS3B was more efficient m 
collaborating with the successive pairs of 5' sense oligos to amplify the partial internal Canf 
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I cDNA than oligo AS3A, again suggesting that the leucine residue was encoded by CTX 
rather than TT(A or G). The 240 bp DNA fragment was subcloned into Bluescript KS 
plasmid vector and sequenced as described above. It too proved to be an authentic £an_f I 
cDN A. The missing residue in the amino acid sequence of £an_f I at residue 54 was 
5 determined to be an asparagine on the grounds that: 1) no amino acid signal was found at 
residue 54 during protein sequence analysis; and 2) the asparagine residue resides within a 
consensus sequence for N-linked glycosylation (N54 155 T56). These data strongly suggest 
that the N54 residue is modified by N-linked glycosylation. 

To obtain the 3' portion of the £ani I cDNA, the RACE (Rapid Amplification of 
10 cDNA ends) PCR protocol was employed (Frohman, M. A; fiLaL, (1988) Ems. 

■js fafl Acad. Sci.. 85: 8998 - 9002). First strand cDN A synthesis from total dog parotid RNA 
" was carried out as described above, except that the JM3 oligonucleotide was substituted for 
oligo dT as the primer in the reaction. The JM3 primer (SEQ ID NO:22) has an arbitrary 
tract of -40 nucleotides encoded 5' of an oligo dT tract (Figure 2). Hence, upon priming of 
1 5 poly A+ RNA td make cDN A, this known nucleotide tag is covalently linked to the 5' end of 
the nascent cDN A transcripts. Using nested 5' primers, SD (residues 73-79 (SEQ ID NO:15)) 
and SE (residues 80-86 (SEQ ID NO:17)), based on known Caiif I cDNA sequence from 
MOPAC PCR analysis and nested primers based on the known JM3 primer sequence (JM3-1 
(SEQ ID NO:23) and JM3-3 Bam (SEQ ID NO:24)) in PCR amplification as above (except 
20 the PCR program was 40 X [92° C 30 seconds/60 0 C 1 minutes/75° C 1 minutes]), a DNA 
fragment -500 bp in length was amplified. When probed against a kinase labeled degenerate 
oligonucleotide, Dog Probe 2 (SEQ ID NO:9) [residues 88 to 94 of mature £an_f I (SEQ ID 
NO:8)], this band proved positive for hybridization. Upon subcloning into plasmid vector 
and DNA sequence analysis, three different partial 3' Canfl cDNAs were identified: £an_fl 
25 (SEQ ID NO:61), 2£sn_f I (SEQ ID NO:62), and 3C_an_f I (SEQ ID NO:63), each as shown in 
Figure 9. 2Cajif I had a sequence that encoded a methionine residue followed by an 
asparagine-proline pair. These landmark residues for protein sequence analysis predicted: 1 ) 
a CNBr fragment with the NH2 terminal sequence MAKLLGRDPEQ...(SEQ ID NO:64); 2) 
an acid sensitive cleavage site at the DP pair, and 3) a proline residue which should prove 
30 refractory to OPA treatment and yield amino acid sequence data where all other NH2 termini 
would be blocked by the treatment Indeed, further protein sequence analysis of purified Can 
f I did identify a CNBr fragment that in conjunction with OPA blockage at the internal 
proline residue had the sequence t ' 

(M)AKLLGRDPEQSQEALEDFOEFSOAKGLNQEILELAQS(E)T (SEQ ID NO:65). Acid 
35 cleavage of the purified protein yielded a peptide with the sequence (D)PEQS(E)EA (SEQ ID 
NO:66). These complimentary data from protein sequence analysis and partial cDNA 
cloning of CaiLf I indicated that the authentic 3' end of the £an_f I cDNA may not have been 
isolated. 
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Comparison of the amino acid sequence data from sequencing purified £an_f I and 
those encoded by the partial cDNAs 2£aaf I and 3£aaf I inferred the origin of the multiple 
species of 3* cDNAs may have been alternative splicing of the nascent Can f I transcript (note 
how in partial cDNA 3 Can f I residues from the NH2 terminus of the CNBr fragment are 
5 found linked to the fragment's COOH terminal residues without the intervening residues). In 
contrast to this hypothesis of multiple cDNAs originating at the level of alternative splicing, 
the above PCR amplification of the 3' end of the Canf I cDNA produced a single prominent 
DN A fragment ~500 bp in length. However, the three partial 3 1 cDN As were either 
significantly longer or shorter than 500 bp. This suggested rare partial cDNAs were being 
1 0 >> > subcloned, perhaps because the authentic CaiLf I cDNA harbored the restrictions site(s) 

-encoded at the ends of the primers used in subcloning pfDNA fragments that arise from PCR 
^ amplification. Hence, when digesting the PGR product representing the Can f I 

cDNA with restriction endonucleases (in this case 5 f EcoR I and 3* BamH I) one would 1) cut 
the authentic Canf I cDNA into at least two pieces, and 2) bias towards subcloning rare 
15 cDNAs that had arisen from alternative splicing of the nascent £anf I RNA traraoripts that 
had exons containing EcoR I and/or BamH I sites deleted. To address this situation, new 
primers with different restriction enzyme sites at their 5' ends were synthesized and used in 
RACE PCR of the 3' end of the Can f I cDNA. The JM3-3 oligo was resynthesized with a 
Bgl II linked to its 5' end [JM3-3XB (SEQ ID NO:21)] (Figure 2), while the 5' primers SD 
20 and SE were resynthesized with Xho I sites at their 5' ends [XSD (SEQ ID NO:16) and XSE 
(SEQ ID NO: 1 8)] (Figure 2). After nested PCR of JM3 primed total dog parotid cDNA 
using these new primers and the previous amplification conditions, the intact 3' end of the 
Q&xS I cDNA (which hybridized to kinase labeled Dog Probe 4, [(SEQ ID NO:20), residues 
115-121 of mature Can f I (SEQ ID NO:19) (Figure 2)] was subcloned and sequenced. The 
25 translated amino acid sequence of the partial cDNA corresponded directly with the protein 
- sequence data and extended it a further 6 amino acids before encountering a stop codon. As 
the cloning artifacts had predicted, both EcoR I and BamH I sites were found in the coding 
region of the intact 3* £anf I cDNA. - 

The 5* end of the Can f I cDNA was cloned using an anchored PCR technique (Roux, 
30 K. H. and Dhanarajan, P., fl990^ Biotechniques . £: 48 - 57; Rafiiar, T. sLaL, (1991) J, Biol. 
Chem. . ?66 : 1229 - 1236). Double strand dog parotid cDNA was synthesized using a kit 
(BRL Superscript cDNA Synthesis System) employing the method of RNase H priming of 
the second strand of cDNA synthesis (Gubler, U., and Hoffman, B. J., (1983) Gene . 25: 263 - 
269). The blunt double stranded cDNA was ligate<i to an anchor adapter, thereby placing a 
35 known sequence at the 5' ends of cDNAs (SEQ ID NO:25; SEQ ID NO:26; and SEQ ID 

NO:27) (see Figure 3). A primer based on the anchor sequence was used as a 5' sense primer 
(AP) in conjunction with a nested pair of 3' antisense primers, ASA (SEQ ID NO:31) 
[residues 18 to 24 (SEQ ID NO:30)} and ASB (SEQ ID NO:33) [residues 25 to 30 (SEQ ID 
NO:32)] based on known Canf I cDNA sequence from MOPAC PCR in sequential rounds of 
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PCR (40 X [92° C 30 seconds/60 o C 1 minutes/75° C 1 minutes]) to amplify the 5' end of the 
gan_f I cDNA (1° reaction ds anchored cDNA template with 5' AP/3' ASB primers: 2° 
reaction 1/1 00th 1° reaction template with 5' AP/3' ASA primers). Agarose gel 
electrophoresis analysis of the 2° reaction revealed a broad band -300 bp in length, which in 
5 Southern blot analysis hybridized to a 32Pkinased degenerate oligonucleotide probe, Dog 
Probe 0 (SEQ ID NO:29) (Figure 3), based on residues 9 to 1 7 (SEQ ID NO:28) of mature 
£anfl. The amplified fragment was subcloned into Bluescript KS plasmid and subjected to 
DNA sequence analysis. Ifs authenticity as the 5' end of the £gn_f I cDNA was confirmed by 
the presence of the first 13 residues of mature Canf I protein at the 3' end of the partial 
10 cDNA. Sequence of the longest partial 5' cDNA extended a further 126 bp and encoded a 26 
amino acid leader sequence not found in mature £an_f I. Although no in-frame stop codons 
were found 5' of the presumed initiator methionine codon (M-26), it is presumed to be the 
true initiator codon and not just an internal methionine residue because: 1) it is embedded 
within a consensus sequence for translation initiation in mammalian cells (Kozak, M., (1986) 
1 5 Cell. M: 283 - 292); and 2) the predicted leader sequence is highly homologous to the leader 
sequences of proteins that are highly related to Canf I (see below). 

A contiguous Canf I cDNA was then amplified and both strands directly sequenced 
as a PCR product to confirm the coding sequence of the molecule. To minimize the 
possibility of introducing errors in the amplified cDNA during the PCR reaction, Pfu I DNA 
20 polymerase (Stratagene, San Diego, CA) was used to amplify the coding cDN A. Pfu I DNA 
polymerase has been documented to introduce an order of magnitude fewer errors thanTaq 
DNA polymerase during PCR (Lundberg, K. S., (1991) Gene, lllfi: 1 - 4). Direct sequencing 
of non-cloned DNA fragments from PCR reactions should also obviate any errors made by 
DNA polymerases during PCR since such errors will be scattered at random throughout the 
25 " population of PCR products (Gyllensten, U. B., and Ehrlich, H. A.. (1988) Proc . Natl . Acad, 
Sr.i. USA . S5_: 7652 - 7656). Primers used in the amplification/sequencing included the 5* 
sense leader ex oligo (SEQ ID NO:35) [residues -26 to - 20 of Canf I (SEQ ID NO:34)] and 
the 3' antisense stop Bgl II oligo (SEQ ID NO:36) [a 24-mer 40 bp 3' of the stop codon of 
£an_f I] (Figure 4). A program of 40 X (95° C 30 seconds/60 o C 45 seconds/75° C 45 
30 seconds) was used with the aforementioned primers and Pfu I DNA polymerase to amplify a 
DNA fragment -600 bp in length, which was subsequently isolated as a band on a 0.6% low 
melt agarose gel. This gel slice was melted at 70° C and used as template for PCR 
sequencing using 32P labelled oligonucleotides as primers and a commercially available kit 
(AmpliTaq Cycle Sequencing Kit, Perkin Elmer Cetus, Norwalk, CT). A program of 30 X 
35 (95° C 30 seconds/60 o or 68° C 30 seconds) was used for the cycle sequencing. The PCR 

sequencing strategy to obtain unambiguous sequence of the mature Canf I protein from both 
strands of the amplified cDNA is depicted in Figure 4 with the following sense primers: start 
ex (SEQ ID NO:37); SB (SEQ ID NO:38); SK (SEQ ID NO:39); SE (SEQ ID NO:40); and 
SH (SEQ ID NO:41), and the following antisense primers: Dog 9 (SEQ ID NO:42); ASK 
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yX AM PT F 4: F^ tT"- tinn nf wiWNA From Canine Parotid Glands 

ani l HtTT'"ff" ff:anfn 
The strategy used to clone £an_f II is schematically drawn in Figure 14. cDNA was 
synthesized using the above preparation as a template in reverse transcription. In the next 

5 step ds cDNA was used as a template for PCR along with degenerate primers which were 
designed based on amino acid sequence of £anS II and oriented to amplify a fragment of Can 
fllcDNA. PCR product was gel purified and than subjected to direct sequencing. The 
nucleotide sequence confirmed that the PCR product represents a fragment of £anf II cDNA. 
Further polymerase chain reactions were performed using £an_fH specific primers in order to 

10 obtain aTonger fragment which was subsequently used asa probe to screen a dog cDN A 
library. Positive clones were identified, plaque purified, sequenced and full length CsoJU 
cDNA was obtained. 

Fresh parotid glands from a single outbred dog were obtained from the Tufts 
University School of Veterinary Medicine (Worcester, MAX washed in phosphate buffered 

15 saline, and immediately frozen on dry ice. RNA was extracted essentially as described in the 
literature (Chirgwin, J. M. fiUL, (1979) Biochemistry, 1&: 5294 - 5299.). Two glands 
(approx. 50 g) were pulverized to a powder with a mortar and pestle frozen in liquid N2, and 
suspended in 25 ml of GTC buffer (50% w/v guanidine thiocynate, 0.5% w/v Na lauryl 
sarcosine, 0.7% v/v b-mercaptoethanol, 0.1% v/v Sigma Antifoam A (Sigma, St. Louis MO), 

20 25 mM Na citrate, pH 7.0) and vortexed until dissolved. Genomic DNA present in the 

solution was sheared by forcing the solution through a 16 gauge needle until the viscosity of 
the solution no longer decreased. The sheared solution was centrifuged at 3 K rpm for 5 
minutes at room temperature. The supernatant was then sheared further through a 23 gauge 
needle until its viscosity no longer decreased, and cleared by centrifugation at 5 K rpm for 5 

25 minutes at room temperature. The solution was layered onto a CsCl cushion (5.7 M CsCl, 1 0 
mM EDTA pH 7.5) and centrifuged in a Beckman SW 41 Ti rotor at 35 K rpm for 1 6 hours 
at 1 7° C. The supernatant was discarded and the RNA pellet washed in 70% EtOH then 
resuspended in 0.3 M NaOAc, 10 mM EDTA, 0. 1% SDS. Two volumes of absolute EtOH 
were added, and precipitation carried out on dry ice. RNA was pelleted by centrifugation, 

30 70% EtOH washed, and resuspended in TES (10 mM Tris, 1 mM EDTA, 0.1% SDS). The 
final yield was ~ 5.3 mg of total RNA. mRNA was isolated from total RNA by 
chromatography on oligo(dT) cellulose using the method described by Aviv, H. and Leder, P. 
nw Natl. Aca^Rci. USA . (1972) £2: 1408). 20 {ig of poly (A) RNA was obtained from 
1.7 mg of total RNA. 

35 The conversion of gland poly(A) mRNA into double stranded cDN A was carried out 

using standard procedure (Ausubel etal.. (1993) Current Protocols in Molecular Biology , 
John Wiley & Sons). First, poly(A)RNA was copied onto cDNA was using Amersham 
cDNA Synthesis System Plus according to the manufacturer's procedure. 4 ng of poly (A) 
RNA was used as a template and oligo dT(12-l 8) was used to prime first strand synthesis. 
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The RN A in RNA/DNA hybrid was than removed by RNaseH and the second strand was 
synthesized by DNA Polymerase I. Double stranded cDNA was completed and made blunt 
by T4 DNA polymerase and E. coli DNA ligase according to Gubler, U. and Hoffman, B.J., 
(1983) Gene, 25: 263). 

5 Initially, PCR amplification (Mullis, K.B. and Faloona, F., (1987) Methods Enzymol, 

155 - 355 . 36O) was used to obtain a partial cDNA of Cjnfll encoding amino acids 16 to 29 
(SEQ ID NO: 97) of the mature protein. Using dog parotid cDNA as a template with 
* degenerate primer pairs (synthesized on an Applied Biosystems 392) based on residues 3 to 6 
(S1A) (Figure 13) (SEQ ID NO: 91) and (SIB) (Figure 13) (SEQ ID NO: 92) and on residues 
10 - 33-to 38 (ASP2A) (Figure 13) (SEQ. ID NO: 93) and (ASP2B) (Figure 13) (SEQ ID NO: 94) 
< of mature Cjn_f II, a DNA fragment of the predicted size (-120 bp) was amplified using a 
PCR kit (GeneAmp kit, Perkin Elmer Cetus, Norwalk, CT). Conditions for the reaction 
were: denaturation for 1 minute at 94°C; annealing for 1 minute at 42°C and polymerization 
for 1 minute at 72°C. The cycle was repeated 30 times. As a test of its authenticity, the 
1 5 amplified fragment was subjected to direct sequencing using a commercially available kit 
(AmpliTaq Cycle Sequencing Kit, Perkin Elmer Cetus, Norwalk, CT) according to the 
instructions supplied. Primers used in the amplification/sequencing (which included SI A, 
SIB, ASP2A and ASP2B) had been end labeled with y-32P ATP using T4 polynucleotide 
kinase. The following program of 19 cycles (denaturation at 95°C for 1 minute; annealing 
20 at 50°C for 1 minute and extension at 72°C for 1 5 seconds) was used for the cycle 

sequencing in a MJ Research Minicycler. The nucleotide sequence of about 40 nucleotides 
of the fragment was shown to correctly encode residues 16 to 29 of mature Cjarif II (SEQ ID 
NO: 97). The missing residue in the amino acid sequence of a native protein at the position 
26 was found to be asparagine. 
25 ^; In order to generate a Canf ll specific probe long enough (>100bp) to be used to 
..screen a cDNA library, the 5' and 3' ends of the £anJf II cDNA were cloned using an 
•t anchored PCR technique (Roux, K. H. and Dhanarajan, P., (1990) Bjotechniques, £: 48 - 57; 
, Rafnar., T. gLaL, (1991) T Biol. Chem. . 266: 1229 - 1236) (Figure 15). Double stranded dog 
cDNA was synthesized as described above. The blunt double stranded cDN A was than 
30 ligated to an anchor adapter AT/AL (Figure 16; SEQ ID NO: 96 and 102) thereby placing a 
known sequence at 5' and 3' ends of cDNAs (Figure 15 A). In order to obtain the 5' end of 
Can f II cDNA, a primer based on the anchor sequence AP2 (Figure 16) (SEQ ID NO: 95) 
was used as a 5' primer in conjunction with 3' antisense primers, D2-1 (residues 22 to 30) 
(Figure 13 and 16) (SEQ ID NO: 74), D2-2 (residues 17 to 25) (Figure 13 and 16) (SEQ ID 
35 NO: 75) and D2-3 (residues 16 to 21 ) (Figure 13 and 1 6) (SEQ ID NO: 76) based on known 
Canf ll cDNA sequence obtained from initial PCR. Sequential rounds of PCR (40 X [92° C 
30 seconds/60 o C 1 minute/75° C 1 minute]) were carried out to amplify the 5' end of the Can 
f II cDNA. In the 1° reaction, double stranded anchored cDNA was used as a template along 
with 5' AP2/3' D2-1 primers; in the 2° reaction l/20th of the 1° reaction mixture was used 
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with 5' AP2/3' D2-2 primers; in the 3° reaction l/20th of 2° reaction mixture was used with 
AP2/D2-3 primers). 1% agarose gel electrophoresis of the reaction products revealed the 
presence of a single band of- 300 bp long. As expected from the position of primers (see 
Figure 1 5), the 2° and 3° reactions products migrated faster than 1 ° reaction product. The 
5 amplified fragment from the 3° reaction was gel purified and subjected to DNA sequence 
analysis. It's authenticity as the 5' end of the Canl II cDNA was confirmed by the presence 
of the first N-terminal residues of mature Qanl II protein (Figure 15A, shaded residues). The 
* 5* portion of cDNA (Figure 15A) (SEQ ID NO: 98) encoded part of the amino acid signal 
sequence which was not found in mature £an_f II. The 3' portion of Canf II (Figure 15B) 
1 0 (SEQ ID NO: 99) was synthesized in an analogous manner as the 5' end except that single 
stranded cDNA was used as a template and APA (Figure 16) (SEQ ID NO: 100) was used as 
a 3' primer and D2-4/SEQ ID NO: 77), D2-5 (SEQ ID NO: 78) and D2-6 (SEQ ID NO: 79) 
(Figures 14 and 16) were used as internal primers in PCR. Direct sequencing of the PCR 
product from the 3° reaction revealed the presence of 8 amino acids of the known Cj&nf II 
15 sequence followed by 8 amino acids downstream of the known sequence. 

In order to clone the full length Canfll cDNA, a cDNA library was prepared and 
screened using standard published procedures (Gubler and Hoffman, Ausubel at al., supra). 
The lambda cDNA library was custom made by Clontech Laboratories, Inc. as follows: the 
first strand cDNA was primed from poly(A) KNA by oligo d(T)15. The blunt ended double 
20 stranded cDNA was ligated to an Eco RI linker CCGGAATTCCGG (SEQ ID NO: 101), 

digested with EcoRI, size selected in order to obtain fragments larger than 500bp, and ligated 
into EcoRI cut and dephosphorylated vector \gt 10. The DNA was then packaged into 
lambda particles, plated on C600-hfl and C-600 E.Coli strains and the library titer was 
determined. The unamplified library consisted of 1.53x106 independent clones (clear 
25 plaques on C-600 hfl host) which contained inserts ranging in size from 0.6kb to 3-4kb. The 
average size of the insert as determined by PCR using Clontech XgtlO primers was 1 .2 kb. 
100,000 clones were plated oh C-600hfl host and screened using £an_f ll specific probe. All 
manipulations leading to the cloning and sequencing of £an_f II cDNA were done according 
to Pmtnrnk in MoWnlar Biology (Ausubel £LaL supra)- A CsnfH probe was obtained by 
30 PCR amplification of dog cDNA using D2-9 (SEQ ID NO: 80) and D2-13 (SEQ ID NO: 83) 
primers (Figure 16). The PCR product was then 32 P labeled by random priming. 20 positive 
clones were plaque purified, phage DNA was extracted from individual clones, digested with 
EcoRI and subcloned into pUC18. The presence of inserts was verified by digestion of the 
plasmid DNA with EcoRI and the nucleotide sequence of three individual clones was 
35 determined using Sequenase (United States Biochemicals) and AmpliTaq Cycle Sequencing 
kit (Perkin Elmer Cetus, Norwalk CT) according to manufacturer's instructions. PCR cycle 
sequencing analysis served to resolve some DNA sequence ambiguities resulting most 
probably from the formation of secondary structures on GC-rich £an_f II template. The 
sequencing strategy is depicted in Figure 17. Primers used in the sequencing/amplification 



WO 94/16068 

-36- 



™iiv available 16-mer Reverse Sequencing Primer (-21) and 17-mer 
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WO 94/16068 



PCT7US93/12468 



-37- 



'5 



signal sequence and does not show any similarity to any known signal sequence. The £an_f 
II cDNA encodes a protein having a predicted molecular weight of 1 8.2 kDa, with a single 
potential N-linked glycosylation site. Because 1) no amino acid signal was found at residue 
25 during protein sequence analysis, and 2) the asparagine residue resides within a consensus 
sequence for N-linked glycosylation (N26 K27 S28), these data strongly suggest that the N26 
residue is modified by N-linked glycosylation. N-linked glycosylation may increase the 
molecular weight of the mature protein. The deduced amino acid sequence of the mature 
* protein encoded by the nucleic acid sequence is identical to the known NH2-terminal and 
internal amino acid sequence determined by amino acid sequence analysis of purified Cjffl_f II 
10 profein conducted as described in Example 3. 

The expression of £an_f n in various tissues was studied using the Northern blot 
technique. Poly (A) RNA or total RNA from various tissues was separated by 
electrophoresis through a 1 .5% agarose gel containing 2.2 M formaldehyde. (Ausubel sLsL; 
Sipm). After electrophoresis, the separated RNAs were transferred onto GeneScreen 
15 membrane (NEN). Transfer, hybridization with a 32P labeled £arif II probe [obtained by 
PCR mediated amplification using D2-9 (SEQ ID NO: 14) and D2-13 (SEQ ID NO: 83) 
(Figure 1 6) primers] and washings of the filter were performed according to the 
manufacturer's instructions. It appeared that the £ml II probe hybridized specifically at high 
stringency to RNA from dog parotid gland and to RNA from tongue epithelial tissue (Figure 
20 20). It did not hybridize to RNA from liver, or submaxiliary gland. Hybridization was 
observed to two bands of about 800 bp and 900 bp long, suggesting that £an_f II may be 
encoded by two mRNA species. It is unlikely that two RNAs are transcribed from two 
different genes since a Southern blot experiment suggested that only a single copy Can_f II 
geneis present in the dog genome. The two mRNAs encoding Canf II may be due to 
25 alternative splicing or to degradation of the mRNA. The former possibility seems very likely 
since different splicing configurations in the 3' noncoding region has been described for 
proteins which are similar to Cmf II (Clark eUL, (1984) EMBOL, 3_: 1045 - 1052, see also 
below). 

In order to infer the possible biological function of £anf II, its amino acid sequence 
30 was compared to those in the GenBank, GenBankUpdate, EMBL, and EMBL Update 

sequence data bases using the NCBI BLAST network service (Altschul, S. F., et_aL, (1990) L 
Mol Biol. . 215: 403 - 41 0). Cjmf II precursor protein displayed high similarity to two groups 
of related proteins: 1) Mouse Urinary Proteins (MUPs) (Figure 21) (SEQ ID NO: 90) and 2) 
urinary a-2-globulins of rat (A2U) (Figure 21) (SEQ ID NO: 89). The sequences of MUPs 
35 and A2Us show them both to be members of the lipocalin protein family (Cavaggioni sLaL 
(1987) FFBS I^tt.. 212 : 225 - 228). These are small proteins capable of binding 
hydrophobic molecules with high affinity and selectivity. This family now contains over 20 
different proteins, principally identified through sequence homology (Flower SLaL, (1991) 
Rionhv* Re*. Commun. . I£0_: 69 - 74). The function of MUP and A2U remains 
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unclear, but it is proposed that rodent urinary proteins are responsible for binding 
pheromones and their subsequent release from drying urine (Bocskei eLsl, (1992) Nature, 
360 : 1 86 - 188). They are synthesized at different levels in the liver and in the submaxillary, 
lachrymal, sublingual, parotid and mammary glands (Shahan £LaL, (1987) Mol. Cell. Biol,, < 
5 7: 1947 - 1 954). MUP IV for example, is expressed predominantly in the lachrymal and 
parotid glands, but not in liver (Shahan et al.. aipn). The amino acid similarity of £an_f II, 
MUP and A2U as well as their pattern of expression may indicate that £an_f II is canine 
homolog of lipocalins. Interestingly, immunologic and biochemical studies of MUPs and 
MUP-related proteins have shown that these proteins are important human allergens (Lorusso 
in ^ fiog*) ,T Ml^rpv Clin Immunol.. 78: 928: Platts-Mills etal.. (1987) J. Allergy Clin. 
Immunol.. 22: 505; Gurka sLsL, (1989) J. Allergy Clin. Immunol., 32: 945 - 954). 

FYA1VTPI,F St Bacterial E xpression of Can f I 

Bacterial expression of Can f I was performed as follows. The vector pETl Id AHR 

1 5 His6 (Novagen, Madison, WI; modified at ImmuLogic Pharmaceutical Corporation by J. P. 
Morgenstem) was modified for expression of Cm! I in £ coli, by removal of the internal 
EcoR I restriction site (at residues E143F144) from the £auf I cDNA to be inserted in the 
vector. This modification was necessary since all DNA fragments in this vector are cloned in 
frame with the His6 NH2 terminal leader sequence at a mutual 5' EcoR I site. Hence, EcoR I 

20 sites internal to the insert must be avoided. The pETl 1 d AHR His6 vector also requires that 
inserts have a 3' BamH I site. However, since restriction sites such as Bgl II and Bel I are 
compatible with BamH I overhangs, they could be placed at the 3' end of the Qsal I cDNA, 
avoiding the need to mutate the internal BamH I site. A cDNA encoding the mature Can fl 
protein had its internal EcoR I site removed, a unique EcoR I site placed at its 5' end, and a 

25 Bgl II site placed at its 3' end in a two step PCR reaction (Ho elaL, supra) using Pfu I DNA 
.... polymerase to minimize errors during amplification (Figure 6). Two halves of the C_an_f I 
cDN A were amplified in primary PCR reactions (template: PCR fragment from cycle 
sequencing, program: 40 X [95° 30 seconds/60 o C 45 seconds/75° C 45 seconds]) with the 5' 
portion of the molecule being amplified with the Start ex (SEQ ID NO:46 and SEQ ID 

30 NO:47)/EF antisense (SEQ ID NO:50 and SEQ ID NO:48) primer pair and the 3* portion 
amplified with the sense EF (SEQ ID NO:49 and SEQ ID NO:48)/TAG Bgl II (SEQ ID 
NO:5 1 and SEQ ID NO:52) primer pair. Both EF primers were designed to introduce a point 
mutation in the EcoR I site at residues E143F144 ofCaaf I from GAATTC to GAGTTC, 
which would maintain the El 43 residue since glutamate can be encoded by GAA or GAG 

35 codons. 

Amplified DNA fragments of the expected size were isolated in gel slices from a 
0.6% low melt agarose gel, melted at 70° C, mixed and used as template in a secondary (2°) 
PCR reaction with Start ex and TAG Bgl II primers. Mutagenized regions bearing the 
E143F144 pair should hybridize in the initial stages of the reaction to link the 5' and 3' ends 
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of the Canf l cDNA, while the extreme 5' and 3' primers should serve to amplify the intact 
mutagenized cDN A. The entire reaction was phenol/chloroform extracted, EtOH 
precipitated, 70% EtOH washed, and digested with EcoR I and Bgl II. A band of the 
expected size (-450 bp) was isolated as a gel slice from a 0.6% low melt agarose gel, melted 
5 at 70° C, ligated at room temperature to EcoR I/BamH I digested pET 1 1 dAHR His 6 

plasmid, and the ligation transformed into XL-1 bacteria (Stratagene). Miniprep analysis of a 
3 ml culture of one of the transformed colonies (using a Qiagen [Foster City, CA] plasmid 
mini kit) by Eco RV digestion revealed the presence of an insert of the appropriate size 
within the expression vector. A 300 ml culture seeded with this colony was grown, plasmid 
10 DNA extracted (Qiagen plasmid midi kit) and subjected to DNA sequence analysis. The 
entire 453 bp insert was shown to have the correct sequence for mature £gn_f I cDNA 
(including the mutated E143 codon from GAA to GAG), with the addition of an in-frame 
His6 reporter group (SEQ ID NO:53) encoded at its 5' end. This His6 reporter group was to 
be used in metal ion affinity purification of the recombinant protein using NTA Ni++ 
15 chelating resin (Qiagen; Hochuli el^L supjs). 

A single colony of BL21(DE3) pET 1 Id AHR His6£arif IdRI bacteria was inoculated 
into a 2 ml brain heart infusion (BHI) culture (+200ug/ml ampicillin) and incubated at 37° C 
until turbid but not saturated. At this point 6 ul was removed and added to 600 ul of BHI and 
mixed. 100 ul was spread onto each of 6 BHI agar plates (+200 ug ampicillin) and incubated 
20 overnight at 37° C. The next morning the bacterial lawn was scraped off of the plates, pooled 
and resuspended in 20 ml of BHI media, and then aliquoted 1 ml each into each of 18 500 ml 
. BHI cultures (+200 jig/ml ampicillin) in 2 liter Ehrlenmeyer flasks. Cultures were incubated 
at 37° and shaken at 300 rpm until the A600 reached 1.0. Isopropyl-B-D- 
thiogalactopyranoside (IPTG) was than added to final concentration of 1 mM to induce 
25 expression of the T7 RNA polymerase gene which would in turn induce expression of 
His6C_an_f I protein from the hybrid T7 gnl 0/lac 0 promoter. Expression was allowed to 
proceed for 2 hours after which the bacteria were pelleted and resuspended in 6 M guanidine 
hydrochloride (GuHCl), 100 mM NaPC4, 10 mM Tris, 100 mM 2-mercaptoethanol pH 8.0. 
Extraction was carried out for 1 hour with vigorous shaking and terminated by pelleting of 
30 the insoluble material at 10 K rpm in a JA-10 rotor (Beckman) for 1 hour. Supernatant was 
removed, and its pH adjusted to 8.0 before loading onto a 50 ml NTA agarose column that 
had been equilibrated in 6 M GuHCl, 100 mM NaPCM, 10 mM Tris, pH 8.0. The column 
was washed by step gradient as follows: 1) 6 M GuHCl, 100 mM NaP04, 10 mM Tris, pH 
8.0, 2) 8 M urea, 100 mM NaP04, 10 mM Tris, pH 8.0, 3) 8 M urea, 100 mMNaOAc, 10 
3 5 mM Tris, pH 6.3 with each wash proceeding until the A280 of the effluent from the column 
reached background. Recombinant His6£ajill protein was eluted from the column with 8 M 
urea, 100 mM NaOAc, 10 mM Tris, pH 4.5. Yield of the pooled peak fractions was -100 mg 
with a purity of -80% as determined by densitometry of a sample of the material analyzed by 
SDS-PAGE. 



WO*4/16*» ^ 

E coH transformed with the vector pETl Id containing the nucleic aeid encoding Cm 
f I have been deposited with the ATCC at accession number 69167. 

nf il THTT if ft Mnmr" ""■ Einrnilwi "f Cm f I Prote i n 
. To pirapo^ly glycosylated form of recombinant Canil proteu, expessed ,n 

mao.naliancdls.Cmflexr-ssionwasc.rriedoutasfoHows, Full length £mf I protem 
(mcluding the leader sequence not found in the mature protein) when express* « 

h^htevel transient expression of recombinant £an_f I were employed. Ftr*trans,en 

^^X»e°»™3T3ce..sus^ 

Bgl II (SEQ ID NO:51 and SEQ ID NO:52) primer patr (s« Figurep.The entire PCR 
^onw^pheno^Morofbrmexr^E^ 

.^^tare (see Fieure 7). The ligation was transformed tnto competent XL-1 Blue t. 
roU^d^^Tcdories selected on ampiciUin (200 ugfal) dishes. DNA sequence analysis 

25 coiodesCusinga^plasrnidminiki,). Both plasmids had inserts wtm the correct 
^auenceoftheS'andS'endsoffulllengthCanil- 
^ex^toaidmpurin^ 

reporter group was to be fused at its COOH terminus. This was accomphshed by exctsmg 

protein mat had been modified with the addition 6 htsttdmes (Ftgure 8) The COOH temunal 
follows- a sense oligonucleotide (SEQ ID NO:56) encodmg restdues E123 to QMS of the 

oligonucleotideencodingresiduesEHl toQMS/affisetract^pcrfonCS 

SEO ID NO-59); and 3' His6 TAG BgUI (SEQ ID NO:60), were synthesrzed and punned 
SSU* (W- Biosystems, Fosfcr City, CA). In addition, 
^primers composed of me firs, 24 nucleotides of the aforementioned ohgonuc.eot.des, 
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5' His6 link and 3* His6 link, were also synthesized. Linking and amplifying the two long 
oligonucleotides to generate the EcoR I - Bgl II DNA fragment encoding the £an_f I COOH 
tenninus-His6 fusion was performed by PCR. 10 pmoles of each large oligonucleotide were 
used as substrate in Pfu I mediated PCR with 1 uM primers using the program 40 X (95° C 

5 30 seconds/60 o C 45 seconds/75° C 30 seconds). The entire PCR reaction was phenol- 
chloroform extracted, EtOH precipitated, 70% EtOH washed and digested with EcoR I and 
Bgl II to generate correct overhangs for insertion into pJ7Q£an_f I. A band of the expected 
size (-110 bp) to encode the Caaf I COOH terrninus/His6 fusion was isolated as a gel slice 
on a 2.0% NuSieve agarose gel, melted at 70° C and ligated to EcoR I/Bgl H digested pJ7Q 

10 Canf I at room temperature. The ligation was transformed into competent XL-1 (Blue E. coli 
and positive colonies selected on ampicillin (200 ug/ml) dishes. Plasmid was isolated from 4 
cultures inoculated with different colonies and subjected to DNA sequence analysis at the 3* 
end of the insert. Clone #3 contained the expected His6 residues linked in frame to the 
COOH terminus of Canf I, so a large scale grow of this culture was undertaken to obtain 

15 large quantities of the pJ7Q £an_f I His6 plasmid for transfection. A one liter culture was 
amplified in 15 uM chloramphenicol once A600 reached 0.6. 800 ug of plasmid were 
isolated after alkaline lysis and two successive rounds of CsCl banding (Sambrook SLaL, 
supra V 

Ten plates ofNIH 3T3 cells were seeded at a density of 1.7 X 106 cells per 15 cm 
20 tissue culture dish, and the following morning, subjected to calcium phosphate transfection 
with 20 ug/dish of pJ7£i Canf I His6 plasmid (Parker, B. A., and Stark, G. R., (1979) 
J. Virol.. 3_L: 360 - 369). 48 hours post transfection, supernatant was pooled from the dishes, 
filtered through a 0.45 a unit (Costar), and brought to a concentration of 1 uM imidazole, 
with the addition of protease inhibitors 1 mM PMSF, 1 ug/ml, pepstatin 1 ug/ml soybean 
25 trypsin inhibitor, and 1 ug/ml leupeptin. Metal ion affinity purification of the £an_f IHis6 
protein was achieved by loading the supernatant onto a 2 ml NTA agarose (Qiagen) column 
that had been equilibrated in 1 X PBS, 1 mM imidazole, 1 mM PMSF, 1 ug/ml pepstatin, 1 u 
g/ml soybean trypsin inhibitor, and 1 ug/ml leupeptin. Non-specifically bound proteins were 
washed off the column with 10 column volumes of 1 X PBS, 20 mM imidazole, ImM PMSF, 

30 1 ug/ml pepstatin, 1 Ug/ml soybean trypsin inhibitor, and 1 ug/ml leupeptin. Can f I His6 

protein was specifically eluted from the column in 1 X PBS, 80 mM imidazole, 1 mM PMSF, 
1 ug/ml pepstatin, 1 Ug/ml soybean trypsin inhibitor, and 1 ug/ml leupeptin (Hoffmann, A. 
and Roeder, R. G., (1991) N,.r. Acids. Res.. 19_: 6337 - 6338, and Janknecht fiLaL, sipm). 
Aliquots of the eluted fractions were analyzed by 12% SDS PAGE. Coomasie blue staining 

35 of the gel revealed three major bands of molecular weight -70 kDa, 45 KDa, and 25 KDa. 

Since the molecular weight of native inununoaffinity purified Can f I is 25 KDa (Schou sLflL, 
supra and de Groot et al.. supra) it was suspected that the smallest band on the SDS gel was 
recombinant Can f I His6. 
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FX AM PT F 7; Pirg f t P i P Hin ff Human IeF, to Recombinant Can f I 

fry F.I .IS A an fl Western blot 
An ELISA plate (IMMULON II Dynatech, Chantilly, V A) was coated with 
bacterially expressed recombinant Cani I, (rCani I) at 0.5 ug/well in PBS-Tween and 
5 incubated overnight at 4°C. The coating antigen was removed and the wells were blocked 
with 0 5% gelatin in PBS, 200 ul/well for two hours at room temperature. Plasma from a 
skin test positive dog allergic patient, #901, was serially diluted with PBS-Tween and 100 ul 
* was added per well and incubated overnight at 4°C (the plasma dilutions were tested in 
duplicate). The second antibody (biotinylated goat anti-human IgE 1:1000, Kirkegaard& 
10 Perry Laboratories) was added at 100 ul/well for one hour at room temperature. This 
Ablution was removed and streptavidin-HRPO at 1 :10,000, (Southern Biotecnology 
«^ssociates, Inc. Biraungham, AL) was added for one hour at room tmperature. TMB 
Membrane Peroxidase Substrate system (Kirkegaard & Perry Laboratories) was freshly 
mixed and added at 100 pi. The color was allowed to develop for 2-5 minutes. The reaction 
was stopped by the addition of 100 ul/well of 1M phosphoric acid. The plate was read on a 
Mircoplate EL 310 Autoreader (Biotek Instruments, Winooski, VT) with a 450 nm filter. 
The absorbance levels of duplicate wells were averaged. The graphed results are shown in 
Figure 11. The data shows that patient #901 has a high level of anti-Caai specific IgE such 
that at 1/162 dilution (the highest plasma dilutin used) me binding level to the recombinant 
Caaf I is still two-fold above background. A known negative patient (#250) was also tested 
and shown to be negative by this assay. 

Western Blot analysis of four different protein preparations as potential sources Caul 
I was performed. The four different preparations used for Western blotting were: dog hair 
extract, dog saliva, bacterially expressed rCaaf I (used for the ELISA) and rCaaf I as 
25 expressed in a roammalian cell culture system. These preparations were loaded on a 15% 
. acrylamide SDS-PAGE (lanes T-4, respectively) at 5 ug/lane. The protein concentrations 
• were based on the Bicmchoninic acid (BCA) assay (Pierce, Rockford, IL). Following 
electrophoresis, the proteins were transferred to nitrocellulose and stained with India Ink. 
The nitrocellulose sections were blocked by incubation in Tween solution with 1% milk/1% 
30 BSA for 30 minutes room temperature, then probed with patient #901 plasma or negative 
control patient #250 at a 1 :20 dilution in tween mild solution. This first antibody incubation 
was carried out overnight at room temperature. Biotinylated goat anti-human IgE (KPL) was 
used as the second antibody at a 1:5000 dilution for a two hour incubation. Streptavidin- 
HRPO (1:20,000 dilution) and the ECL Western Blot Detection system (Amersham, 
35 Arlington Heights, IL) were used for detection by chemiluminescence. A 20 second 
exposure was performed and the film developed. The results from this assay show no 
recognition of the protein preparations by patient #250 IgE. The IgE from dog allergic 
patient #901 shows distinct binding to £an_f I proteins in the saliva and the bacterially 
expressed recombinant £an_f I (lanes 2 and 3, Figure 12). The sizes of the protein forms are 



20 
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different between the two preparations and this is due to fact that the native £an_f I protein 
found in dog saliva is glycosylated and runs with an apparent molecular weight of 28,000 
daltons whereas the recombinant form from bacteria has no carbohydrate modification. The 
binding of IgE from serum of patients #901 to the mammalian expressed r£an_f I is 
5 extremely faint and at present is only suggestive of positive expression. The full length 
bacterially produced form, lane 3, has an apparent molecular weight of 18,000 daltons and 
the larger IgE binding proteins in both lanes 2 and 3 are most likely dimeric structures of the 
lower molecular weight proteins. 

10 fl ylMPT F, ft notarial F.Tnression of Can f II 

In an attempt to readily produce large amounts of pure recombinant Qsol II protein, 
expression of £an_f II in bacteria was carried out as follows. Full length cDNA encoding 
mature £an_f II protein was obtained by amplification of the molecule from total parotid 
cDNA in PCR using D2-3pet (SEQ ID NO: 103) and D2-5pet (SEQ ID NO: 104) primer pair 
1 5 (Figure 1 3). Primers were designed to introduce EcoRI and BamHl restriction sites at 5' and 
3' of cDNA molecule respectively. The pETl Id AHR His6 vector requires that inserts have 
5' EcoRI site and 3' BamHI site. 

Amplified DNA fragment of the expected size was purified by electrophpresis in low 
melting agarose and ligated at room temperature to EcoRI/BamHI digested pETl Id AHR 
20 His6 plasmid and the ligation mixture was used to transform XL-1 Blue bacteria (Stratagene). 
Miniprep analysis of several transformed colonies (using Qiagen [Foster City, CA] plasmid 
mini kit) revealed the presence of an insert of the appropriate size within the expression 
vector. A 300 ml culture inoculated with one colony was grown, plasmid DNA extracted and 
subjected to sequence analysis. The entire 486 bp insert was shown to have the correct 
25 sequence for the mature £an_f II cDNA with the addition of a reporter group encoded at the 5" 
end. This His6 reporter group was to be used in metal ion affinity purification of the 
recombinant protein using NTA Ni++ chelating resin (Qiagen). A single colony of 
BL21 (DE3) pET 1 1 d AHR His6Can_f II bacteria was inoculated into a 2 ml brain heart 
infusion (BHI) culture (+200ng/ml ampicillin) and incubated at 37° C until turbid but not 
30 saturated. At this point 6 ul was removed and added to 600 ul of BHI and mixed. 100 ul 
was spread onto each of 6 BHI agar plates (+200 ug ampicillin) and incubated overnight at 
37° C. The next morning the bacterial lawn was scraped off of the the plates, pooled and 
resuspended in 20 ml of BHI media, and then aliquoted 1 ml each into each of 1 8 500 ml 
BHI cultures (+200 ug/ml ampicillin) in 2 liter Ehrlenmeyer flasks. Cultures were incubated 
35 at 37°C and shaken at 300 rpm until the A600 reached 1 .0. Isopropyl-B-D- 

thiogalactopyranoside (IPTG) was than added to 1 mM to induce expression of the T7 RNA 
polymerase gene which would in turn induce expression of His6£an_f II protein from the 
hybrid T7 gnl 0/lac O promoter. 
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Expression was allowed to proceed for 2 hours after which the bacteria were pelleted 
and resuspended in 6 M guanidine hydrochloride (GuHCl), 100 mM NaPCM, 10 mM Tris, 
100 mM 2-mercaptoethanol pH 8.0. Extraction was carried out for 1 hour with vigorous 
shaking and terminated by pelleting of the insoluble material at 10 K rpm in a JA-10 rotor 
5 (Beckman) for 1 hour. Supernatant was removed, and its pH adjusted to 8.0 before loading 
onto a 50 ml NTA agarose column that had been equilibrated in 6 M GuHCl, 100 mM 
NaP04, 10 mM Tris, pH 8.0. The column was washed by step gradient as Mows: 1) 6 M 
• GuHCl! 100 mM NaPCM, 10 mM Tris, pH 8.0, 2) 8 M urea, 100 mM NaPCM, 10 mM Tris, 
pH 8.0,' 3) 8 M urea, 100 mM NaOAc, 10 mM Tris, pH 6.3 with each wash proceeding until 
10 the-A280 of the effluent from the column reached background. Recombinant His6£auf II 
protein was eluted from the column with 8 M urea, 100 mM NaOAc, 10 mM Tris, pH 4.5. 
^Yield of the pooled peak fractions was-100 mg with a purity of~80% as determined by 
densitometry of a sample of the material analyzed by SDS-PAGE. 

E. coli transformed with the vector pETl Id containing the nucleic acid encoding Can - 
15 fil have been deposited with the ATCC at accession number XXXX. 

F yampi f. 9: 1**"* Rindin g »f Hnman IpR to Native and Recombinant Can f II 

Plasma samples from 14 dog-allergic patients (skin test 4+) were assayed for IgE 
binding to native Cjffllll and rCjmf H. An ELISA plate (Immulon II Dynatech, Chantilly, 

20 VA) was coated with native and bacterially expressed recombinant £ml U at 0.5 ug/well in 
PBS-Tween and incubated overnight at 4° C. The coating antigen was removed and the wells 
were blocked with 0.5% gelatin in PBS, 200 ul/well for two hours at room temperatre. 
Binding of human IgE to the coating antigen was detected using biotinylated goat-anti- 
human IgE, streptavidin linked to peroxidase and TMB substrate. Reactions were read on a 

25 plate reader at 450 nm (A450). Of 14 plasma samples tested for IgE to native Caul II, 5 
'contained detectable antibody binding (Figure 22A), with plasma from patients #901 and 
-#227 containg the highest levels. Similarly, of 23 plasma samples tested for IgE to 
recombinant, bacterially expressed fiattf n, several contained detectable antibody binding 
(Figures 22B and C). 

30 

r ,X Afr py F in? r M f I Hnman T Cell Proliferation Analysis 

To identify peptides having Can! I T cell stimulating activity several peptides derived 
from Canfl were produced and cultured with human T cell lines primed with recombinant 
Csnil protein and the responses were determined by standard T cell proliferation assays. A 
35 set of peptides derived from Cmf I (Construct 1 (SEQ ID NO:105), Construct 2 (SEQ ID 
NO: 106), and Construct 3 (SEQ ID NO:107)), each representing a portion of the Caaf I 
protein were used in proliferation assays. In addition, the assays included two peptides 
(A0095 (SEQ ID NO:108), amino acids 7 through 19; and A0096 (SEQ ID NO:109); amino 



WO 94/16068 



-45- 



PCTAJS93/12468 



acids 42 through 54) which were selected as containing potential T cell epitopes using the 
algorithm described in Hill et al., Journal oflmmunologY 141: 1 84-197. 

Constructs 1, 2 and 3 were produced by expressing and purifying portions of the £an_f 
I protein in E. coli using standard techniques. DNA fragments encoding Construct 1 (amino 

5 acids 1 through 65 of £aji_f I) and Construct 2 (amino acids 56 through 1 08) were obtained 
by amplification from a plasmid pETlldA HRHis6£an_f IA wntaining 
full length Caul" I cDNA and were subcloned into an EcoRI/BamHI site of pETl Id vector 
* containing a His6 reporter sequence. A DNA fragment encoding Construct 3 (amino acids 90 
through 148) wasamplified from the same plasmid and subcloned into an EcoRI site of 

10 pETlld vector. Recombinant proteins were affinity purified on a NTA Ni++ chelating resin 
(Qiagen) according to published protocol. 

In order to perform in vitro assays of human T cell proliferative response to £an_f I 
and the five £an_f I peptides described above, whole blood obtained from subjects allergic to 
£anf I in a skin prick test was passed through a Lymphocyte Separation Media (LSM) to 

1 5 remove platelets, red blood cells and granulocytes. The resulting peripheral blood 

mononuclear cells (PBMC) were stimulated with 50ug/ml of recombinant £ajj_f I produced 
as described in Example 2 for 6 days in RPMI 1640 medium supplemented with 5% heat 
inactivated human AB serum, 2mM L-glutamine,10mM HEPES, 50nM 2-mercapto-ethanol 
andl 00 U/ml penicillin and streptomycin. 

20 A second separation LSM was performed to remove high density cell debris and dead 

cells. The resulting PBMC cells were allowed to proliferate for 12-1 8 days. During this time 
the medium was supplemented with additions of recombinant IL2 (5units/ml) and 
IL4 (5units/ml.). 

When the lymphocytes reached a point of rest (determined such that an overnight 
25 pulse of 20,000 cells with *H thymidine was within the 2000 - 4000 CPM range), the cells ■ 
were restimulated for analysis in secondary proliferation assays. Secondary T cell 
proliferation assays included 2x1 04 T cells /well, 5x1 PBMC/ well (irradiated with 3500 
rads) as antigen presenting cells. Antigens were assayed in duplicate or triplicate wells at the 
following concentrations: 
30 rCanf I: 4, 20 and 1 00 ug/ml. 

Esptiiss: 3, 15, and 75 ug/ml. 

nng extract ; 3, 15 and 75 ug of protein/ml. The concentration of £anf I in this 
preparation dog extract is unknown. 

Constructs : initially at a single concentration of 20ug/ml, then at 3, 1 5, and75 ug/ml. 
35 Constructs were insoluble at 75ng/ml. 

PHA was added at 1 ug/ml to indicate nonspecific activation ability. Tetanus toxoid, 
an irrelevant antigen, was added at dilutions of 1:2000, 1:4000 and 1:8000 to indicate T cell 
specificity. 
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After 3 days of culture under conditions of secondary assay luCi of 3 H thymidine 
was added to each culture well for overnight incubation. Cultures were harvested on glass 
fiber filters and 3 H thymidine incorporation was measured by p scintillation counting. 
Stimulation indices which measure the strength of a T cell response to a peptide were 
5 calculated by dividing 3H thymidine uptake of treated cultures by 3 H thymidine uptake of 

untreated medium controls. 

The results of the secondary T cell proliferation assays are shown in Figures 24-25. 
• Figure 24 graphically compares the stimulation indices of individual subjects to r£aall, 
peptides A0095-A0096 and Constructs 1-3. This comparison indicates that significant areas 
10 o£T cell reactivity in the Canfl protein are found at all three parts of the protein, as shown 
« by the substantial stimulation indices of Constructs 1-3 which, together, encompass the entire 

Can f I protein sequence. 

Positivity indices for the peptides shown in Figure 1 0 were calculated by multiplying 
the mean T cell stimulation index (Figure 25) by the percent of the tested individuals who 

15 had a positive response or a T cell stimulation index of at least two. The percentage of 
positive respondent for each tested peptide were as follows: r£anf I: 89%, A0095: 43%, 
A0096: 43%, Construct 1: 64%, Construct 2: 73%, Construct 3: 82%. Comparison of 
positivity indices (Figure 10) which measures both the strength of a T cell response to a 
peptide (S.I.) and the frequency of a T cell response to a peptide in a population of dog 

20 dander allergen sensitive individuals indicates that both N terminal (amino acids 1 through 
65) and C terminal (amino acids 90 through 108) ends of the QmSl protein contain a number 
of T cell epitopes. 

FOTTTVAT.HNTS 

25 Although the invention has been described with reference to its preferred 

embodiments, other embodiments can achieve the same results. Those skilled in the art will 
recognize or be able to ascertain using no more than routine experimentation, numerous 
equivalents to me specific embodiments described herein. Such equivalents are considered to 
be within the scope of this invention and are encompassed by the following claims. 

30 



WO 94/16068 



-47- 



PCT/US93/12468 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: 

(A) NAME: IMMULOGIC PHARMACEUTICAL CORPORATION 

(B) STREET : 610 Lincoln Street 

(C) CITY: Walt ham 
10 (D) STATE: MA 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 02154 

(G) TELEPHONE: (617) 466-6000 

(H) TELEFAX: (617)466-6040 



15 



(ii) TITLE OF INVENTION: Allergenic Protein and Peptides from Dog 

Dander and Uses Therefor 



20 (iii) NUMBER OF SEQUENCES: 109 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

25 (C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: ASCII Text 

(v) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
30 (B) FILING DATE: 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US08/156,549 

(B) FILING DATE: 22-NOV-93 

35 (A) APPLICATION NUMBER: US07/999,712 

(B) FILING DATE: 31 -Dec- 92 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Mandragouras , Amy E. 
40 (B) REGISTRATION NUMBER: 36,207 

(C) REFERENCE /DOCKET NUMBER: IMI-026CPPC 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE; (617) 227-7400 
45 (B) TELEFAX: (617) 227-5941 



(2) INFORMATION FOR SEQ ID NO:l: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 525 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
55 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..525 

5 (ix) FEATURE : 

(A) NAME/KEY: matjpeptide 

(B) LOCATION : 79. -525 

10 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

„ ~*n CTC ACC ATC GGC TTC AGC CTC ATT GCG ATC CTG 

£ ™ £ £ £ £ £ «y - ~ U. Ue Ma n. 

-26 -25 " 20 
15 ~ nrl rrr TTG GGA AAG GAC ACT GTG GCT GTG TCA 

S S S°» £ £ » S £ £ "p - ~ - - - 

'--10 " 5 

_ rrr ATG ACA GCA GAC CAG GAG GTG CCT GAG 

20 £ E 5 £ S £ £ £ ~ «p - - - «- Glu 

_ aT p otc AAA GCC CAG AAG GGG GGC 

« "° r £ £ vS £ £ ^ £ 25 £ "a GU> ,ys oly 0!y 

25 Lys Pro Asp Ser vax " 35 
25 

« ,w **r ATC ACC ATG CTG ACA AAT GGT CAG TGC CAG AAC 

£ £ r u s £££«—. - - «• ° i » - 

30 40 45 

„„„ ... - nr TCT GAG CCT GGC AAA TAC ACG GCA 

s s s £ S 2 e s 2 S « «y - *** - ^ 

35 „™ n*n r-rr TTC ATC CAG CCG TCC CCG GTG AGG GAC 

Z £ Z £ S £ £ £ tie «. » « - V* »j ~P 

75 80 
„ rrc gag ctc CAT GGG AGG CAG ATC CGA 

£ Tyr £ £ £ £ £ S «« - - "» £ 

90 95 

ATG G =c «, r s T s ~ « « - s Si £ £ £ 

Met Ala Lys Leu Leu Gly Arg asp ^ 
105 110 

£££££ = = = 2£S; = S: = " 

120 125 

„ m rr TCT rCA GGA GGA CAG TAG 

£ £ £ £ £ £ £ £ S - ™ Gly Gly Gln 

1 Aft x " 



40 



45 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



525 



55 



135 140 
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(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 174 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

10 * (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys Thr Leu Leu Leu Thr He Gly Phe Ser Leu He Ala He Leu 
-26 -25 -20 "15 

15 Gin Ala Gin Asp Thr Pro Ala Leu Gly Lys Asp Thr Val Ala Val Ser 



-10 



-5 



Gly Lys Trp Tyr Leu Lys Ala Met Thr Ala Asp Gin Glu Val Pro Glu 
10 " 20 

20 „ 
Lvs Pro Asp Ser Val Thr Pro Met He Leu Lys Ala Gin Lys Gly Gly 
y 25 30 35 

Asn Leu Glu Ala Lys lie Thr Met Leu Thr Asn Gly Gin Cys Gin Asn 
25 40 45 50 

He Thr Val Val Leu His Lys Thr Ser Glu Pro Gly Lys Tyr Thr Ala 
55 6° 65 70 

30 Tyr Glu Gly Gin Arg Val Val Phe He Gin Pro Ser Pro Val Arg Asp 

75 80 85 

His Tyr He Leu Tyr Cys Glu Gly Glu Leu His Gly Arg Gin He Arg 
90 95 100 

35 

Met Ala Lys Leu Leu Gly Arg Asp Pro Glu Gin Ser Gin Glu Ala Leu 
105 HO 115 

Glu Asp Phe Arg Glu Phe Ser Arg Ala Lys Gly Leu Asn Gin Glu He 
40 120 125 130 

Leu Glu Leu Ala Gin Ser Glu Thr Cys Ser Pro Gly Gly Gin 

135 140 145 

45 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

55 ( V ) FRAGMENT TYPE: internal 



-50- 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Trp Tyr Leu Lys Ala Met Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 ~ (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
20 GGGAATTCTG GTAYTTRGCN ATGAC 

(2) INFORMATION FOR SEQ ID NO: 5: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



35 



40 



45 



50 



(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GGGAATTCTG GTAYCTNAAR GCNATGAC 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 

Asp Gin Glu Val Pro Glu Lys Pro 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



10 



15 



25 



30 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
GAYCARGARG TNCCDGARAA RCC 23 

(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
( v) FRAGMENT TYPE : internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Tyr lie Leu.Tyr Cys Glu Gly 
l 5 



35 (2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TACATTCTNT AYTGTGARGG 20 

(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
55 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



WO 94/16068 
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(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



5 



40 



Met lie Leu Lys Ala Gin Lys Gly 
1 5 



10 (2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 
15 " (C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: CDNA 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGGGATCCYT TYTGNGCYTT YAADATCAT 
(2) INFORMATION FOR SEQ ID NO: 12: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iv) ANTISENSE: yes 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGGGATCCYT TYTGNGCYTT YAGDATCAT 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: cDNA 

(iv) ANTISENSE: yes 



55 ( x i) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

GGGGATCCYT CACARTAYAA DATRTA 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iv) ANTISENSE: yes 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
15 GGGGATCCYT CACARTANAG DATRTA 26 

(2) INFORMATION FOR SEQ ID NO: 15: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Gly Gin Arg Val Val Phe lie 
1 5 



35 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 16; 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : CDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGGCTCGAGG CCAGCGTGTC GTGTTCATC 29 
(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 7 amino acids 
55 (B) TYPE: amino acid 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 
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5 



(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17; 
Gin Pro Ser Pro Val Arg Asp 



1 5 



10 



(2) INFORMATION FOR SEQ ID NO: 18; 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
j5 (c) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GGGCTCGAGC AGCCGTCCCC GGTGAGGGAC 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 7 amino acids 
3Q (b) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
35 (v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
40 Gin Glu Leu Ala Glu Asp Phe 



1 



5 



45 



(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

50 (D) TOPOLOGY: linear / 

(ii) MOLECULE TYPE: cDNA 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

CARGARGCNC TDGARGAYTT 
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10 



15 



20 



35 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iv) ANTI SENSE : yes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GGGGAGATCT CGAGAGGAAG CTGCGGCCGC TGCA 34 

(2) INFORMATION FOR SEQ ID NO: 22 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iv) ANTISENSE: yes 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CGAATACGAC TCACTATAGG AAGCTGCGGC CGCTGCAGTA CTTTTTTTTT TTTTTTTT 58 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE : cDNA 

(iv) ANTISENSE: yes. 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

CGAATACGAC TCACTATAGG 20 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

10 

(iv) ANTISENSE: yes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

15 

CCTAGGAGGA AGCTGCGGCC GCTGCA 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

GGGTCTAGAG GTACCGTCCG 

35 (2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
GGGTCTAGAG GTACCGTCCG ATCGATCATT 

50 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



25 
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(ii) MOLECULE TYPE: CDNA 
(iv) ANTISENSE: yes 

(Xi) SEQUENCE DESCRIPTION; SEQ ID NO:27; 
AATGATCGAT GCT 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 
15 (b) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

20 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2B 

25 Lys Trp Tyr Leu Lys Ala Met Thr Ala Asp 
l 5 10 



30 



40 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 29; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
AARTGGTAYC TNAARGCNAT GACAGCAGAC 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 

Gin Glu Val Pro Glu Lys Pro 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iv) ANTI SENSE : yes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GGGGATCCAG GCTTCTCAGG CACCTCCTG 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
30 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 
35 Asp Ser Val Thr Pro Met 



10 



15 



20 



40 



50 



55 



i 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 base pairs 

(B) TYPE: 'nucleic acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : CDNA 

(iv) ANTI SENSE : yes 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
GGGGATCCAT GGGAGTCACT GAGTC 
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(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
5 (B) TYPE: amino acid 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

10 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

15 Met"Lys Thr Leu Leu Leu Thr lie 
1 5 



20 



(2) INFORMATION FOR SEQ ID NO: 35: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 

CCTCGAGATG AAGACCCTGC TCCTCACCAT C 

35 (2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 J>ase pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
45 (iv) ANT I SENSE : yes 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 
50 GGGAGATCTC AGAGGGTCAT GGAGCTGCTG CCC 



(2) INFORMATION FOR SEQ ID NO: 37: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

CCCTCGAGGA CACTGTGGCT GTGTCAGGGA AATGG 

10 (2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
_ (c) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
CCTGAGAAGC CTGACTCA 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
ACTGCATACG AGGGCCAGCG T 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 
55 GGGAATTCCA GCCGTCCCCG GTGAGGGAC 



20 



25 



30 



35 



40 
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15 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
ATCACCATGC TGACAAATGG T 

(2) INFORMATION FOR SEQ ID NO: 42: 



21 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: CDNA 

(iv) ANTISENSE: yes 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

AATCT CCTGG TTCAATCCTT T 21 

35 (2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 -base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
45 (iv) ANTISENSE: yes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
50 ACGCTGGCCC TCGTATGCAG T / 21 

(2) INFORMATION FOR SEQ ID NO: 44: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
(iv) ANTISENSE: yes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
GGGGATCCAT GGGAGTCACT GAGTC 

* (2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS : 

j 5 _ (a) LENGTH: 21 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: cDNA 

(iv) ANTI SENSE : yes 

25 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

GTGCAGGACC ACCGTGATGT T 

30 (2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
35 (p) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 



40 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Glu Phe Asp Thr Val Ala Val Ser Gly Lys 
5 10 



45 1 



(2) INFORMATION FOR SEQ ID NO: 47: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 



21 



55 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
GGAATTCGAC ACTGTGGCTG TGTCAGGGAA A 31 

5 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 
(A) IiENGTH: 7 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

15 ~~ (v) FRAGMENT TYPE : internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

20 Glu Phe Ser Arg Ala Lys Gly 
1 5 



25 



(2) INFORMATION FOR SEQ ID NO: 49: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

GAGTTCTCAA GAGCCAAAGG AT 22 

40 (2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRAND EDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
50 (iv) ANT I SENSE : yes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
55 ATCCTTTGGC TATTGAGAAC TC 22 
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(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 31 base pairs 
c (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



10 



15 



20 



25 



30 



35 



(ii) MOLECULE TYPE: cDNA 
(iv) ANTISENSE: yes 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
GGGAGATCTA CTGTCCTCCT GGAGAGCAGG T 

} 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 52: 
Thr Cys Ser Pro Gly Gly Gin 



31 



1 5 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
Met Gly His His His His His His Glu Phe Asp Thr Val Ala Val 

c 1U 
1 b 
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15 



25 



30 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
CCCCTCGAGC CACATGAAGA CCCTGCTCCT CACCATC 37 

(2) INFORMATION FOR SEQ ID NO: 55: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



{ i i ) MOLECULE TYPE : pep t ide 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 



Glu Phe Ser Arg Ala Asn Lys Gly Leu Asn Gin Glu lie Leu Glu Leu 
1 5 10 15 

Ala Gin Ser Glu Thr Cys Ser Pro Gly Gly Gin 
35 20 25 

(2) INFORMATION FOR SEQ ID NO: 56: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
GAATTCTCAA GAGCCAAAGG ATTGAACCAG GAGATTTTGG AACTCGCGCA GAGCGAAACC 60 
TGCTCTCCAG GAGGACAG 78 
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10 



15 



25 



30 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
GGGAATTCTC AAGAGCCAAA AGGATT 

(2) INFORMATION FOR SEQ ID NO: 58: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 14 amino acids 

(B), TYPE : amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
<v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Glu Thr Cys Ser Pro Gly Gly Gin His His His His His His 
1 5 10 



35 (2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
45 (iv) ANTISENSE: yes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
50 TTTAGATCTA GTGGTGGTGG TGGTGGTGCT GTCCTCCTGG AGAGCAGGTT TC 

(2) INFORMATION FOR SEQ ID NO: 60: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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10 



(ii) MOLECULE TYPE: cDNA 
(iv) ANTISENSE: yes 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 
TTTAGATCTA GTGGTGGTGG TGG 

(2) INFORMATION FOR SEQ ID NO: 61: 



(i) SEQUENCE CHARACTERISTICS: 
15 ~ (A) LENGTH: 145 amino acids 

(B) TYPE : amino acid 
( D ) TOPOLOGY : 1 inear 



20 



25 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Asp Thr Val Ala val Ser Gly Lys Trp Tyr Leu Lys Ala Met Thr Ala 
T 5 10 15 



Asp Gin Glu Val Pro Glu Lys Pro Asp Ser Val Thr Pro Met lie Leu 
30 20 25 30 

Lvs Ala Gin Lys Gly Gly Asn Leu Glu Ala Lys lie Thr Met Leu Thr 
35 40 45 

35 Asn Gly Gin Cys Gin Asn He Thr Val Val Leu His Lys Thr Ser Glu 
50 55 60 

Thr Ala Tyr Glu Gly Gin Arg Val Val Phe He Gin 



70 75 80 



Pro Gly Lys Tyr 
65 
40 

Pro Ser Pro Val Arg Asp Arg Tyr He Leu Tyr Cys Glu Gly Asp Leu 
85 90 95 

Leu Pro Gin Ala His Leu Leu His Pro Ser Cys His His His Ser Leu 
45 100 105 no 

Leu Gin Ala His His. Arg Leu Leu Leu Pro His Lys Lys Leu Leu Gin 
115 120 125 

50 Gly Asp Pro Cys Val Ala Gin Trp Phe Ser Ala Cys Leu Gly Leu Arg 
130 135 140 

Ala 
145 

55 
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(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

10 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
15 asp Thr Val Ala Val Ser Gly I*. Trp Tyr Leu Lys Ala Met Thr Ala 

^ Gin Glu Val Pro Glu Lys Pro Asp Ser Val Thr Pro Met He Leu 

20 25 

20 Lys Ma Oln Lys «y «» « - •» - ~ « ~ 

35 40 



« sly «. cys <*» *s» n. Thr Va! val Leu Kis Lys Thr Ser Glu 
« o" Lys Tyr Thr Ma Tyr O. «Y O- «J *al « »- "° °£ 



65 



30 Pro S.r Pro Val «, «P His Tyr XI. Leu Tyr cys du «y «• - 

8 5 

HU =ly «. XI. «e, M. Lys L.U Leu 01, « £ « «« 

100 

35 G1 „ Ma His His Leu Leu Leu Pro His Lys Lys Leu Leu «- 
115 120 
Mp Pro cys V.1 Ma d» Trp Phe ser M, Cys Leu Oly Leu «9 M- 
40 130 135 

(2) INFORMATION FOR SEQ ID NO: 63: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Pro Glu Lys Pro Asp Ser Val Thr Pro Met lie Leu Lys Ala Gin Lys 

1 5 10 
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Glv Gly Asn Leu Glu Ala Lys He Thr Met Leu Thr Asn Gly Gin Cys 
20 25 30 

Gin Asn He Thr Val Val Leu His Lys Thr Ser Glu Pro Gly Lys Tyr 
5 35 40 45 

Thr Ala Tyr Glu Gly Gin Arg Val Val Phe He Gin Pro Ser Pro Val 
50 55 60 

10 Arg Asp His Tyr He Leu Tyr Cys Glu Gly Glu Leu His Gly Arg Gin 
65 70 75 80 

He Arg Met Ala Lys Gly Leu Asn Gin Glu He Leu Glu Leu Ala Gin 
85 90 95 



15 



30 



45 



55 



Ser Glu Thr Cys Ser Pro Gly Gly Gin 
100 105 



20 (2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 



35 



Met Ala Lys Leu Leu Gly Arg Asp Pro Glu Gin 
! 5 10 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

50 Met Ala Lys Leu Leu Gly Arg Asp Pro Glu Gin Ser Gin Glu Ala Leu 
1 5 10 3.5 



Glu Asp Phe Glu Phe Ser Ala Lys Gly Leu Asn Gin Glu He Leu Glu 
20 25 30 

Leu Ala Gin Ser Glu Thr 
35 



1 
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(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

JO (v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

15 Asp- Pro Glu Gin Ser Glu Glu Ala 
1 5 



20 



(2) INFORMATION FOR SEQ ID NO: 67: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 791 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



30 (ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 195.. 734 

(ix) FEATURE: 

35 (a) NAME /KEY: mat_j?eptide 

(B) LOCATION: 253.. 734 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 
40 AGAGCTGGAC CCGTGTGTGT GCTGGCCAAT GAGCCCTGGA GGGTCCGGCT CCAGAGTACC 60 

CTCTTGGCAC AGGGCCGAGT CCATCGGGAC AGATGAACCT AGAGGACTCC ACTGCCCTCC 
45 CATCCACGGG GCCGGGTCAC CAGACTCTGC AAGTCTCCAG CTGTCGCCAA ACCCAGACAG 



AAGGTGCTGT GGAC ATG CAG CTC CTA CTG CTG ACC GTG GGC CTG GCA CTG 
AAGGTGCTGT W, ^ ^ ^ ^ ^ ^ ^ ^ Qly Leu Ala LeU 



55 GGC CTA GAG GAG CTG TCT GGG AGG TGG CAC TCC GTT GCC CTG GCC TCC 
Gly Leu Glu Glu Leu Ser Gly Arg Trp His Ser Val Ala Leu Ala Ser 
10 



120 
180 
230 



-19 -15 -« 

50 ATC TGT GGC CTC CAG GCT CAG GAG GGA AAC CAT GAG GAG CCC CAG GGA 278 
lie Cys Gly Leu Gin Ala Gin Glu Gly Asn His Glu Glu Pro Gin Gly 
-5 1 5 



326 



15 20 25 
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AAC AAG TCC GAT CTG ATC AAA CCC TGG GGG CAC TTC AGG GTT TTC ATC 374 
Asn Lys Ser Asp Leu lie Lys Pro Trp Gly His Phe Arg Val Phe He 
30 35 40 

5 CAC AGC ATG AGC GCA AAG GAC GGC AAC CTG CAC GGG GAT ATC CTT ATA 422 
His Ser Met Ser Ala Lys Asp Gly Asn Leu His Gly Asp He Leu He 
45 50 55 

10 CCG CAG GAC GGC CAG TGC GAG AAA GTC TCC CTC ACT GCG TTC AAG ACT 470 
Pro Gin Asp Gly Gin Cys Glu Lys Val Ser Leu Thr Ala Phe Lys Thr 
60 65 70 

GCC ACC AGC AAC AAA TTT GAC CTG GAG TAC TGG GGA CAC AAT GAC CTG 518 
15 Ala Thr Ser Asn Lys Phe Asp Leu Glu Tyr Trp Gly His Asn Asp Leu 
75 80 85 

TAC CTG GCA GAG GTA GAC CCC AAG AGC TAC CTG ATT CTC TAC ATG ATC 566 
Tyr Leu Ala Glu Val Asp Pro Lys Ser Tyr Leu He Leu Tyr Met He 
20 90 .95 100 105 

AAC CAG TAC AAC GAT GAC ACC AGC CTG GTG GCT CAC TTG ATG GTC CGG 614 
Asn Gin Tyr Asn Asp Asp Thr Ser Leu Val Ala His Leu Met Val Arg 
110 H5 120 



25 



40 



GAC CTC AGC AGG CAG CAG GAC TTC CTG CCG GCA TTC GAA TCT GTA TGT 662 
Asp Leu Ser Arg Gin Gin Asp Phe Leu Pro Ala Phe Glu Ser Val Cys 
125 130 135 



30 GAA GAC ATC GGT CTG CAC AAG GAC CAG ATT GTG GTT CTG AGC GAT GAC 710 
Glu Asp He Gly Leu His. Lys Asp Gin He Val Val Leu Ser Asp Asp 
140 145 150 

GAT CGC TGC CAG GGT TCC AGA GAC TAGGGCCTCA GCCACGCAGA GAG CCAAGCA 764 
35 Asp Arg Cys Gin Gly Ser Arg Asp 
155 160 



GCAGGATCTC ACCTGCCTGA GTACGGT 791 
(2) INFORMATION FOR SEQ ID NO: 68: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 
45 (B) TYPE : amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 

Met Gin Leu Leu Leu Leu Thr Val Gly Leu Ala Leu He Cys Gly Leu 
-19 -15 -10 "5 

55 Gin Ala Gin Glu Gly Asn His Glu Glu Pro Gin Gly Gly Leu Glu Glu 

1 5 10 

Leu Ser Gly Arg Trp His Ser Val Ala Leu Ala Ser Asn Lys Ser Asp 
15 20 25 
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Oly His Phe Arg Val Phe He His Ser Met Ser 
35 40 

in c rlv Asd He Leu He Pro Gin Asp Gly 
5 Ala Lys Asp Gly Asn Leu His Gly Asp H^ 6Q 



Leu He Lys Pro Trp 4Q 
30 



G1 „ cys *. val Ser ^ T ^ Phe ,ys T «. ^ Ser « 

65 

10 T _ Qlu ^ Trp Gly His Asn Asp Leu Tyr Leu Ala Glu 

Lys Phe Asp Leu Glu Tyr i^f i 9Q 



85 

80 



w ^ « ^ « « "« " u Met s: Mn sln *"* Mn 

M p Z ~ ~ Leu Val Hi, « Val « 9 **P ~ « £ 

" no 11 

,1 ov. 0 n« Ser Val Cys Glu Asp He Gly 
20 Gin Gin Asp Phe Leu Pro Ala Phe Glu Ser Val cy ^ 

130 

, rln Ile val val Leu Ser Asp Asp Asp Arg Cys Gin 
His Lys Asp Gin He val vax ^ 



25 



30 



35 



Leu *-jt- — *- icq 

145 



Gly Ser Arg Asp 
160 

(2) INFORMATION FOR SEQ ID NO : 6 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 793 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



40 (ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 3 ,.533 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

- S £25 = z ™ s = s z s = s E 



1 5 



S K 5 = S = = S S i - 2 = = = S 

20 25 

Gly Arg Trp His i»er v 45 
35 * u 



47 



95 



143 



1 
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AAA CCC TGG GGG CAC TTC AGG GTT TTC ATC CAC AGC ATG AGC GCA AAG 191 
Lvs Pro Trp Gly His Phe Arg Val Phe lie His Ser Met Ser Ala Lys 
50 55 60 

5 GAC GGC AAC CTG CAC GGG GAT ATC CTT ATA CCG CAG GAC GGC CAG TGC 23 9 

Asp Gly Asn Leu His Gly Asp He Leu lie Pro Gin Asp Gly Gin Cys 
65 70 75 

GAG AAA GTC TCC CTC ACT GCG TTC AAG ACT GCC ACC AGC AAC AAA TTT 287 
10 Glu Lys Val Ser Leu Thr Ala Phe Lys Thr Ala Thr Ser Asn Lys Phe 
. 80 85 90 95 

GAC CTG GAG TAC TGG GGA CAC AAT GAC CTG TAC CTG GCA GAG GTA GAC 335 
Asp Leu Glu Tyr Trp Gly His Asn Asp Leu Tyr Leu Ala Glu Val Asp 
15 - ioo 105 HO 

CCC AAG AGC TAC CTG ATT CTC TAC ATG ATC AAC CAG TAC AAC GAT GAC 383 
Pro Lys ser Tyr Leu He Leu Tyr Met He Asn Gin Tyr Asn Asp Asp 
115 120 125 

20 

ACC AGC CTG GTG GCT CAC CTG ATG GTC CGG GAC CTC AGC AGG CAG CAG 431 
Thr Ser Leu Val Ala His Leu Met Val Arg Asp Leu Ser Arg Gin Gin 
130 135 140 

25 GAC TTC CTG CCG GCA TTC GAA TCT GTA TGT GAA GAC ATC GGT CTG CAC 479 
Asp Phe Leu Pro Ala Phe Glu Ser Val Cys Glu Asp He Gly Leu His 
3.45 150 155 

AAG GAC CAG ATT GTG GTT CTG AGC GAT GAC GAT CGC TGC CAG GGT TCC 527 
30 Lys Asp Gin He Val Val Leu Ser Asp Asp Asp Arg Cys Gin Gly Ser 
160 165 170 175 

AGA GAC TAGGGCCTCA GCCTACGCAG AGAGCCAAGC AGCAGGATCT CACCTGCCTG 583 
Arg Asp 

AGGACTCAGA C CT AT AGG CT CGGGGGACAC CGTACTCAGC TCTGCGTCCC TCTCTGCGAA 643 

CCCTCCAGGT GATCCCAGCA ACAACACCCA CCTGNGCTTC CATGTGCGGN CCTGTCCAGC 703 

CTGCGCCCAC TCCCTGCCTG GGCAGCCACA CACTCCCCAG CCCCCTGCTA TGGTCCCTCC 763 

TCGCATAATA AAGGACATTC CGTTCAAAAA 793 

(2) INFORMATION FOR SEQ ID NO: 70: 



35 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 177 amino acids . 
50 (B) TYPE: amino acid / 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Ser Thr Phe Cys Leu Gly Leu Ala Leu He Cys Gly Leu Gin Ala Gin 



1 



5 10 15 



-74- 

Glu Gly Asn His Glu Glu Pro Gin Gly Gly Leu Glu Glu Leu Ser Gly 
20 25 30 

Arg Trp His Ser Val Ala Leu Ala Ser Asn Lys Ser Asp Leu lie Lys 
5 35 40 45 

Pro Trp Gly His Phe Arg Val Phe He His Ser Met Ser Ala Lys Asp 
50 55 60 

10 Gly Asn Leu His Gly Asp He Leu He Pro Gin Asp Gly Gin Cys Glu 
65 70 75 80 

Lys Val Ser Leu Thr Ala Phe Lys Thr Ala Thr Ser Asn Lys Phe Asp 
85 90 95 

15 Leu Glu Tyr Trp Gly His Asn Asp Leu Tyr Leu Ala Glu Val Asp Pro 

XOO 105 <. HO 

Lys Ser Tyr Leu He Leu Tyr Met He Asn Gin Tyr Asn Asp Asp Thr 
20 115 120 125 

Ser Leu Val Ala His Leu Met Val Arg Asp Leu Ser Arg Gin Gin Asp 
130 135 140 

25 Phe Leu Pro Ala Phe Glu Ser Val Cys Glu Asp He Gly Leu His Lys 
145 150 155 160 

Asp Gin He Val Val Leu Ser Asp Asp Asp Arg Cys Gin Gly Ser Arg 
165 170 175 

30 

Asp 



(2) INFORMATION FOR SEQ ID NO: 71: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 774 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES3 : single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



45 ( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..537 



50 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

CAG CTC CTA CTG CTG ACC GTG GGC CTG GCA CTG ATC TGT GGC CTC CAG 
Gin Leu Leu Leu Leu Thr Val Gly Leu Ala Leu He Cys Gly Leu Gin 
1 5 10 15 

55 

GCT CAG GAG GGA AAC CAT GAG GAG CCC CAG GGA GGC CTA GAG GAG CTG 
Ala Gin Glu Gly Asn His Glu Glu Pro Gin Gly Gly Leu Glu Glu Leu 
20 25 30 



-75- 



TCT GGG AGG TGG CAC TCC GTT GCC CTG GCC TCC AAC AAG TCC GAT CTG 
Ser Gly Arg Trp His Ser Val Ala Leu Ala Ser Asn Lys Ser Asp Leu 



35 



40 45 



ACC AAA CCC TGG GGG CAC TTC AGG GTT TTC ATC CAC AGC ATG AGC GCA 
Thr Lys Pro Trp Gly His Phe Arg Val Phe He His Ser Met Ser Ala 
50 55 60 

AAG GAC GTC AAC CTG CAC GGG GAT ATC CTT ATA CCG CAG GAC GGC CAG 
Lvs Asp Val Asn Leu His Gly Asp He Leu lie Pro Gin Asp Gly Gin 
« 70 75 80 

TGC GAG AAA GTC TCC CTC ACT GCG TTC AAG ACT GCC ACC AGC AAC AAA 
Cys Glu Lys Val Ser Leu Thr Ala Phe Lys Thr Ala Thr Ser Asn Lys 
— 85 90 95 

TTT GAC CTG GAG TAC TGG GGA CAC AAT GAC CTG TAC CTG GCA GAG GTA 
Phe Asp Leu Glu Tyr Trp Gly His Asn Asp Leu Tyr Leu Ala Glu Val 
100 105 HO 

GAC CCC AAG AGC TAC CTG ATT CTC TAC ATG ATC AAC CAG TAC AAC GAT 
Asp Pro Lys Ser Tyr Leu He Leu Tyr Met He Asn Gin Tyr Asn Asp 



X15 



120 125 



GAC ACC AGC CTG GTG GCt CAC CTG ATG GTC CGG GAC CTC AGC AGG CAG 
Asp Thr Ser Leu Val Ala His Leu Met Val Arg Asp Leu Ser Arg Gin 
130 135 140 

CAG GAC TTC CTG CCG GCA TTC GAA TCT GTA TGT GAA GAC ATC GGT CTG 
Gin Asp Phe Leu Pro Ala Phe Glu Ser Val Cys Glu Asp He Gly Leu 
145 150 155 160 

CAC AAG GAC CAG ATT GTG GTT CTG AGC GAT GAC GAT CGC TGC CAG GGT 
His Lys Asp Gin He Val Val Leu Ser Asp Asp Asp Arg Cys Gin Gly 
165 170 175 

TCC AGA GAC TAGGGCCTCA GCCCACGCAG AGAGCCAAGC AGCAGGATCT 
Ser Arg Asp 

CACCTGCCTG AGGACTCAGA CCTATAGGCT CGGGGGACAC CGTACTCAGC TCTGCGTCCC 
TCTCTGCGAA CCCTCCAGGT GATCCCAGCA ACAACACCCA CCTGCGCTTC CATGTGCGGC 
CCTGTCCAGC CTGCGCCCAC TCCCTGCCTG GGCAGCCACA CACTCCCCAG CCCCCTGCTA 



TGGTCCCTCC TCGCATA 



(2) INFORMATION FOR SEQ ID NO: 72: ' / 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 179 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Gin Leu Leu Leu Leu Thr Val Gly Leu Ala Leu lie Cys Gly Leu Gin 
! 5 10 15 

5 Ala Gin Glu Gly Asn His Glu Glu Pro Gin Gly Gly Leu Glu Glu Leu 
20 25 30 

Ser Gly Arg Trp His Ser Val Ala Leu Ala Ser Asn Lys Ser Asp Leu 
10 35 40 45 

' Thr Lys Pro Trp Gly His Phe Arg Val Phe lie His Ser Met Ser Ala 
50 55 60 

i5 Lys Asp Val Asn Leu His Gly Asp lie Leu He Pro Gin Asp Gly Gin 

S". g5 70 75 80 



20 



; *Cys Glu Lys Val Ser Leu Thr Ala Phe Lys Thr Ala Thr Ser Asn Lys 

85 9° 95 

Phe Asp Leu Glu Tyr Trp Gly His Asn Asp Leu Tyr Leu Ala Glu Val 
100 105 HO 



Asp Pro Lys Ser Tyr Leu He Leu Tyr Met He Asn Gin Tyr Asn Asp 
25 H5 120 12 S 

Asp Thr Ser Leu Val Ala His Leu Met Val Arg Asp Leu Ser Arg Gin 
130 135 140 

30 Gin Asp Phe Leu Pro Ala Phe Glu Ser Val Cys Glu Asp He Gly Leu 
145 150 155 160 



35 



40 



His Lys Asp Gin He Val Val Leu Ser Asp Asp Asp Arg Cys Gin Gly 
165 170 175 

Ser Arg Asp 



(2) INFORMATION FOR SEQ ID NO: 73: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 998 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 
45 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO/:73: 
AGAGCTGGAC CCGTGTGTGT GCTGGCCAAT GAGCCCTGGA GGGTCCGGCT CCAGAGTACC 
55 CTCTTGGCAC AGGGCCGAGT CCATCGGGAC AGATGAACCT AGAGGACTCC ACTGCCCTCC 
CATCCACGGG GCCGGGTCAC CAGACTCTGC AAGTCTCCAG CTGTCGCCAA ACCCAGACAG 
AAGGTGCTGT GGACATGCAG CTCCTACWGC ACYKWCYGTC TGGGYYTGGC ACTGATCTGT 



60 
120 
180 
240 
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GGCCTCCAGG CTCAGGAGGG AAACCATGAG GAGCCCCAGG GAGGCCTAGA GGAGCTGTCT 300 

GGGAGGTGGC ACTCCGTTGC CCTGGCCTCC AACAAGTCCG ATCTGAYCAA ACCCTGGGGG 360 

^ CACTTCAGGG TTTTCATCCA CAGCATGAGC GCAAAGGACG KCAACCTGCA CGGGGATATC 420 

CTTATACCGC AGGACGGCCA GTGCGAGAAA GTCTCCCTCA CTGCGTTCAA GACTGCCACC 480 

10 AGCAACAAAT TTGACCTGGA GTACTGGGGA CACAATGACC TGTACCTGGC AGAGGTAGAC 540 

* CCCAAGAGCT ACCTGATTCT CTACATGATC AACCAGTACA ACGATGACAC CAGCCTGGTG 600 

GCTCACYTGA TGGTCCGGGA CCTCAGCAGG CAGCAGGACT TCCTGCCGGC ATTCGAATCT 660 

GTATGTGAAG ACATCGGTCT GCACAAGGAC CAGATTGTGG TTCTGAGCGA TGACGATCGC 720 

TGCCAGGGTT CCAGAGACTA GGGCCTCAGC CYACGCAGAG AGCCAAGCAG CAGGATCTCA 780 

20 CCTGCCTGAG GACTCAGACC TATAGGCTCG GKGGACACCG TACTCAGCTC TGCGTCCCTC 84 0 

TCTGCGAACC CTCCAGGTGA TCCCAGCAAC AACACCCACC TGCGCTTCCA TGTGCGGCCC 900 

TGTCCAGCCT GCGCCCACTC CCTGCCTGGG CAGCCACACA CTCCCCAGCC CCCTGCTATG 960 

GTCCCTCCTC G CAT AAT AAA GGACATTCCG TTCAAAAA 998 
(2) INFORMATION FOR SEQ ID NO: 74: 



15 



25 



30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNES S : single 

(D) TOPOLOGY: linear 



35 



55 



(ii) MOLECULE TYPE: cDNA 



40 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

GGGGGATCCC AGATCGGACT TATTGGAGGC - 30 

(2) INFORMATION FOR SEQ ID NO: 75: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

50 (D) TOPOLOGY: linear • 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
GGGGGATCCG GAGGCCAGGG CAACGGA 27 
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(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

10 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

15 GGGGGATCCA ACGGAGTGCC ACCTCCC 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: CDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

30 

GGGGGAATTC GAGGAGCTGT CTGGGAGGTG G 



(2) INFORMATION FOR SEQ ID NO: 78: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: "linear 

40 

(ii) MOLECULE TYPE: cDNA 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 

GGGGGAATTC AGGTGGCACT CCGTTGCCCT G 
(2) INFORMATION FOR SEQ ID NO: 79: 

50 

(i) SEQUENCE CHARACTERISTICS: / 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
GGGGGAATTC GCCCTGGCCT CCAACAAGTC C 

5 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 



35 



50 



(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

20 GGGGGAATTC GAGGGAAACC ATGAGGAGCC 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH : 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
GGACTTGTTG GAGGCCAGGG C 
(2) INFORMATION FOR SEQ ID NO: 82: 

40 U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
GGGGGAATTC ATCAAACCCT GGGGGCACTT 
(2) INFORMATION FOR SEQ ID NO: 83: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



10 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:83: 
GGGGGATCCA AGTGCCCCCA GGGTTTGAT 
(2) INFORMATION FOR SEQ ID NO: 84: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
15 (c) STRANDEDNESS : single 

~ (D) TOPOLOGY: linear 



20 



25 



30 



35 



(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
CACGGGGATA TCCTTATACC 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
40 GTACAACGAT GACACCAG 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 
45 (a) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

50 (ii) MOLECULE TYPE: cDNA 



55 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
TGTCCCCCGA GCCTAT 
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10 



25 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

15 TCCXTCTGCT GACCGTGG 

(2) INFORMATION FOR SEQ ID NO:88: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



55 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Glu Gly Asn His Glu Glu Pro Gin Gly Gly Leu Glu Glu Leu Ser Gly 
1 5 10 15 

Arg Trp His Ser Val Ala Leu Ala Ser Xaa Lys Ser Asp Leu He Xaa 
20 25 30 

Pro Trp Gly His Phe Arg 
35 



(2) INFORMATION FOR SEQ ID NO: 89: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 
45 (b) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

50 (v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Met Lys Leu He Leu Leu Leu Leu Cys Leu Gly Leu He Leu Val Cys 
1 5 10 15 
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oxx Gly Bis «. «. «• »« — ~ <"» »~ ° ly r ^ " l 

20. " 

Asp Lys Leu » Giy -P «P - « »- ™ ^ ^ S " ~ W 
«, Glu Lys XI. Glu Glu Asn Gly ser Met Ar g Val Phe H* Gin Bis 

50 55 

Leu Glv Phe Lys Leu Cys He Lys Glu 
10 He Asp Val Leu Glu Asn Ser Leu Giy *- 80 

65 70 



» cly Glu cy. Are Lys - Tyr ser val Ala Tyr Ly. « «- 

85 yu 

100 lu:> 
MO Lys Thr »P Tyr Glu Ar, Tyr Val Met Phe His Leu Val Asn Val 

Asn Asn X Gl» Ma Phe Gin Leu ~ Glu Leu Tyr Gly Ar g Thr Lys 

130 135 
MP Leu ser ser Asp Tie Lys Glu Lys Phe «. Lys Leu Cys Glu Ala 



20 



25 



30 



35 



40 



45 



145 150 

. OT lle A sp Leu Thr Lys Thr Asp Arg 

His Gly He Thr Arg Asp Asn He He Asp u ^ 
165 1/U 

Cys Leu Gin Ala Arg Gly 
180 

(2) INFORMATION FOR SEQ ID NO:90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH! 182 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO:90: 

Met Lys Glx Met Leu Leu Leu Leu Cys Leu Gly Leu He Leu Val Cys 
50 i 5 

.OX val Kis Ala Glu Glu Ma Ser Ser Thr Gly «. Asn ~> 

20 25 



55 



01. Lys lie Asn Gly Glu Trp Hi. Thr He He Leu Al, ser cys Lys 
„ Glu Us Xle Glu Asp Asn Gly Asn Phe „ Leu Phe Leu Glu Gin 



50 55 
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10 



25 



35 



He His Val Leu Glu Asn Ser Leu Val Leu Lys Phe His Thr Val Arg 
65 70 75 80 

Asp Glu Glu Cys Ser Glu Leu Ser Met Val Ala Asp Lys Thr Glu Lys 
85 90 95 

Ala Gly Glu Tyr Ser Val Thr Tyr Asp Gly Phe Asn Thr Phe Thr He 
100 105 HO 

Pro Lys Thr Asp Tyr Asp Asn Phe Leu Met Ala His Leu He Asn Glu 
115 120 125 



Lys Asp Gly Glu Thr Phe Gin Leu Met Gly Leu Tyr Gly Arg Glu Pro 
15 " 130 "5 140 

Asp Leu Met Ser Asp He Lys Glu Arg Phe Ala Gin Leu Cys Glu Glu 
145 150 155 160 

20 His Gly He Leu Arg Glu Asn He He Asp Leu Ser Asn Ala Asn Arg 

165 170 175 



Cys Leu Gin Ala Arg Glu 
1B0 

(2) INFORMATION FOR SEQ ID NO: 91: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

40 GGGAATTCAA TCCATCGAAG GAAG 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: CDNA / 



24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:92: 

55 

GGGAATTCAA TCGATCGAAG GAAG 



24 
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(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



10 



30 



40 



(ii) MOLECULE TYPE: CDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 

15 CGXACGGAAA GATGCTGACC CCAGATCGG 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
CTCTGAAAGA TGCTGACCCC AGATCGG 
(2) INFORMATION FOR SEQ ID NO: 95: 



35 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



45 ( X i) SEQUENCE DESCRIPTION: SEQ ID NO: 95 

GGGTCTAGAG GTACCGTCCG 
(2) INFORMATION FOR SEQ ID NO: 96: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96 
GGGTCTAGAG GTACCGTCCG ATCGTCGATC ATT 

5 

- (2) INFORMATION FOR SEQ ID NO: 97: " 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

15 — (v) FRAGMENT TYPE: internal 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

20 

Gly Arg Trp His Ser Val Ala Leu Ala Ser Lys Ser Asp 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 98: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
ATCTGTGGCC TCCAGGCTCA GGAGGGAAAC CATGAGGAGC CCCAGGG 
40 (2) INFORMATION FOR SEQ ID NO: 99: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
AAACCCTGGG GGCACTTCAG GGTTTTCATC CACAGCATGA GCGCA 

55 
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(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 35 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

10 

(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 100: 

15 GGGCTCGAGG TCGAGTTTTT TTTTTTTTTT TTGAC 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

30 

CCGGAATTCC GG 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(iv) ANTI SENSE : yes 



35 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 
50 AATGATCGAT GCT 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : cDNA 



10 



20 



35 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
GGGGGATCCC TAGTCTCTGG AACCCTG 
(2) INFORMATION FOR SEQ ID NO: 104: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS : single 
15 — (d) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 ; 
GGGGAATTCG AGGGAAACCA TGAGGAG 
25 (2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 



40 Asp Thr Val Ala Val Ser Gly Lys Trp Tyr Leu Lys Ala Met Thr Ala 
! 5 10 15 



Asp Gin Glu Val Pro Glu Lys Pro Asp Ser Val Thr Pro Met He Leu 
20 25 30 

Lys Ala Gin Lys Gly Gly Asn Leu Glu Ala Lys He Thr Met Leu Thr 
35 40 45 



Asn Gly Gin Cys Gin Asn He Thr Val Val Leu His Lys Thr Ser Glu 
50 50 55 / 60 



Pro 
65 



55 (2) INFORMATION FOR SEQ ID NO:106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: peptide 
5 (v) FRAGMENT TYPE: internal 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

Thr val val Leu His Lys Thr Ser Glu Pro Gly Lys Tyr Thr Ala Tyr 

c 10 15 



Glu Gly Gin Arg Val Val Phe He Gin Pro Ser Pro Val Arg Asp His 
15 _ 20 25 30 

Tvr lie Leu Tyr Cys Glu Gly Glu Leu His Gly Arg Gin He Arg Met 
35 40 45 

20 Ala Lys Leu Leu Gly 
50 

(2) INFORMATION FOR SEQ ID NO: 107: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:107: 

Leu Tyr Cys Glu Gly Glu Leu His Gly Arg Gin He Arg Met Ala Lys 
X 5 - 1° 15 

Leu Leu Gly Arg Asp Pro Glu Gin Ser Gin Glu Ala Leu Glu Asp Phe 
20 25 30 

Arg Glu Phe Ser Arg Ala Lys Gly Leu Asn Gin Glu lie Leu Glu Leu 
45 35 40 45 

Ala Gin Ser Glu Thr Cys Ser Pro Gly Gly Gin Xaa 
50 55 60 

50 (2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE : internal 



40 
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20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108 : 

Gly Lys Trp Tyr Leu Lys Ala Met Thr Ala Asp Gin Glu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

Ala Lys lie Thr Met Leu Thr Asn Gly Gin Cys Gin Asn 
25 i 5 io 
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Wh af ;g Maimed is: 

1 An isolated nucleic acid comprising a nucleotide sequence encoding a peptide 
having an activity of a dog dander allergen, £an_f I. 

5 ~ 

2. An isolated nucleic acid of claim 1 , which is a cDNA sequence. 

3. An isolated nucleic acid of claim 2, wherein the cDNA comprises a nucleotide 
sequence shown in Figure 5 (SEQ ID NO: 1). 

10 . 

4. An isolated nucleic acid of claim 2, wherein the cDNA comprises the coding 

region of a nucleotide sequence shown in Figure 5 (SEQ ID NO: 1). 

5. An isolated nucleic acid of claim 1 , wherein the peptide comprises an amino 
15 acid sequence shown in Figure 5 (SEQ ID NO: 2). 

6. An isolated nucleic acid of claim 5, wherein the peptide comprises amino acid 
residues 1-148 of the sequence shown in Figure 5 (SEQ ID NO: 2). 

20 7. An isolated nucleic acid of claim 1 , wherein the peptide is at least 50% 

homologous with a sequence comprising an amino acid sequence shown in Figure 5 (SEQ ID 
NO: 2). 

8. An isolated nucleic acid of claim 1, wherein the peptide is encoded by a 
25 nucleic acid which hybridizes under high or low stringency conditions to a nucleic acid 

which encodes a peptide comprising an amino acid sequence shown in Figure 5 (SEQ ID 
NO: 2). 

9. An isolated nucleic acid of claim 1, wherein the peptide is at least 10 amino 
30 acids in length. 

1 0. An isolated nucleic acid of claim 1 , wherein the peptide is at least 20 amino 
acids in length. 

35 ii. An isolated nucleic acid of claim 1, wherein the peptide is at least 25 amino 

acids in length. 

1 2. An isolated nucleic acid of claim 1 , wherein the peptide is at least 30 amino 
acids in length. 
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13. An isolated DN A comprising a nucleotide sequence encoding a peptide of at 
least 20 amino acid residues or more in length and having at least about 50% homology with 
an amino acid sequence comprising a sequence shown in Figure 5 (SEQ ID NO: 2). 

14. An isolated nucleic acid comprising a nucleotide sequence encoding a peptide 
having an activity of a dog dander allergen, £an_f II. 

15. An. isolated nucleic acid of claim 1 4, which is a cDN A sequence. 

1 6. An isolated nucleic acid of claim 1 5, wherein the cDNA comprises a 
nucleotide sequence shown in Figure 1 8 (SEQ ID NO: 67). 



17! An isolated nucleic acid of claim 1 5, wherein the cDN A comprises the coding 
1 5 region of a nucleotide sequence shown in Figure 1 8 (SEQ ID NO: 67). 

18. An isolated nucleic acid of claim 14, wherein the peptide comprises an amino 
acid sequence shown in Figure 18 (SEQ ID NO: 68). 

20 19. An isolated nucleic acid of claim 1 8, wherein the peptide comprises amino 

acid residues 1-161 of the sequence shown in Figure 18 (SEQ ID NO: 68). 

20. An isolated nucleic acid of claim 14, wherein the peptide is at least 50% 
homologous with a sequence comprising an amino acid sequence shown in Figure 1 8 (SEQ 

25 ID NO: 68). 

21. An isolated nucleic acid of claim 1 4, wherein the peptide is encoded by a 
nucleic acid which hybridizes under high or low stringency conditions to a nucleic acid 
which encodes a peptide comprising an amino acid sequence shown in Figure 1 8 (SEQ ID 

30 NO: 68). 

22. An isolated nucleic acid of claim 14, wherein the peptide is at least about 10- 
20 amino acids in length. 

35 23. An isolated nucleic acid of claim 1 4, wherein the peptide is at least about 1 0- 

16 amino acids in length. 

24. An isolated DNA comprising a nucleotide sequence encoding a peptide of at 
least about 10-16 amino acid residues or more in length and having at least about 50% 
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homology with an amino acid sequence comprising a sequence shown in Figure 18 (SEQ ID 
NO: 68). 

25. A recombinant expression vector comprising the nucleic acid of claim 1 . 

26. A recombinant expression vector of claim 25, wherein the nucleic acid is 

cDNA. 

27. A recombinant expression vector of claim 26, wherein the cDNA comprises 
10 the-nucleotide sequence shown in Figure 5 (SEQ ID NO: 1). 

i 28. a recombinant expression vector comprising the nucleic acid of claim 14. 

29. A recombinant expression vector of claim 28, wherein the nucleic acid is 

15 cDNA. 

30. A recombinant expression vector of claim 29, wherein the cDNA comprises 
the nucleotide sequence shown in Figure 1 8 (SEQ ID NO: 67). 

31. A host cell transfected with the recombinant expression vector of claim 29 
capable of directing the expression of a peptide having an activity of Caul I. 

32. A host cell of claim 3 1 which is a eukaryotic cell. 

33 A host cell transfected with the recombinant expression vector of claim 27 
capable of directing the expression of a peptide having an activity of £anll. 

34. A host cell of claim 33 which is a eukaryotic cell. 

30 35. A host cell transfected with the recombinant expression vector of claim 28 

capable of directing the expression of a peptide having an activity of Qml H- 

36. A host cell of claim 35 which is a eukaryotic cell. 

35 37 a host cell transfected with the recombinant expression vector of claim 30 

capable of directing the expression of a peptide having an activity of QssS H. 

38. A host cell of claim 37 which is a eukaryotic cell. 



20 



25 
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39. A method of producing a peptide having an activity of £ml h comprising 
culturing a host cell of claim 31 in medium to express the peptide and isolating the peptide 
from the culture. 

5 40. A method of producing a peptide having an activity of CanHI, comprising 

culturing a host cell of claim 35 in medium to express the peptide and isolating the peptide 
from the culture. 

41. An isolated peptide having an activity of a dog dander allergen, Can f I, 
1 0 produced by recombinant expression of a nucleic acid of claim 1 . 

42. An isolated peptide having an activity of a dog dander allergen, Qssil h 
produced by recombinant expression of a nucleic acid of claim 3 . 

15 43 . An isolated peptide having an activity of a dog dander allergen, Csol h 

produced by recombinant expression of a nucleic acid of claim 4. 

44. An isolated peptide having an activity of a dog dander allergen, Cw f I, 
produced by recombinant expression of a DNA of claim 13. 

20 

45. An isolated peptide having an activity of a dog dander allergen, £ml h 
produced by chemical synthesis. 

46- An isolated peptide of claim 45 which is at least about 10-20 amino acids in 

25 length. 

47. An isolated peptide of claim 46 which is at least about 10-16 amino acids in 

length. 

30 48. An isolated peptide having an activity of a dog dander allergen, Canf II, 

produced by recombinant expression of a nucleic acid of claim 14. 

49. An isolated peptide having an activity of a dog dander allergen, Qml U> 
produced by recombinant expression of a nucleic acid of claim 16. 

35 

50. An isolated peptide having an activity of a dog dander allergen, Qml H, 
produced by recombinant expression of a nucleic acid of claim 17. 
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51. An isolated peptide having an activity of a dog dander allergen, CsnJ II, 
produced by recombinant expression of a DNA of claim 24. 

52. An isolated peptide having an activity of a dog dander allergen, £aB_f H, 
produced by chemical synthesis. 

53. An isolated peptide of claim 52 which is at least about 10-20 amino acids in 

length. 

54. An isolated peptide of claim 52 which is at least about 10-16 amino acids in 

length. 

55. A modified peptide having an activity of Caail. 

56 A modified peptide of claim 55, wherein at least one cysteine residue present 
in the Canil amino acid sequence shown in Figure 5 (SEQ ID NO: 2) is replaced by another 
amino acid residue. 



57 A modified peptide of claim 56, wherein at least one cysteine residue present 
20 in the Carif I amino acid sequence shown in Figure 5 (SEQ ID NO: 2) is replaced by a serine 
residue. 

58. A modified peptide of claim 55, wherein at least one lysine residue is added to 
either the amino or carboxy terminus or both the amino and carboxy terminus of the peptide. 



25 



59. A modified peptide having an activity of £m£ H- 



60 A modified peptide of claim 59, wherein at least one cysteine residue present 
in the Caul II amino acid sequence shown in Figure 18 (SEQ ID NO: 68) is replaced by 

30 another amino acid residue. 

61 A modified peptide of claim 60, wherein at least one cysteine residue present 
in the CmS II amino acid sequence shown in Figure. 1 8 (SEQ ID NO: 68) is replaced by a 
serine residue. 



35 



62. A modified peptide of claim 59, wherein at least one lysine residue is added to 
either the amino or carboxy terminus or both the amino and carboxy terminus of the peptide. 
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63 A substantially pure preparation of a peptide having an activity of a dog 
dander allergen, Canf I. 

64. A substantially pure preparation of a peptide having an activity of a dog 
5 dander allergen, £an_f II. 

65. A composition suitable for pharmaceutical adrninistration comprising at least 
* one peptide having an activity of Canll and a pharmaceutical^ acceptable carrier. 

10 ~ 66. A composition of claim 65, wherein the peptide comprises an amino acid 
sequence of Figure 5 (SEQ ID NO: 2). 

67. A composition of claim 66, wherein the peptide comprises amino acid 
residues 1-148 of Figure 5 (SEQ ID NO: 2). 



15 
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68. A composition suitable for pharmaceutical administration comprising at least 
peptide having an activity of £an_f II and a pharmaceutically acceptable carrier. 



69. A composition of claim 68, wherein the peptide comprises an amino acid 
20 sequence of Figure 18 (SEQ ID NO: 68). 

70. A composition of claim 69, wherein the peptide comprises amino acid 
residues 1 -1 61 of Figure 1 8 (SEQ ID NO: 68). 

25 71 . A method of treating sensitivity to a dog dander allergen in a subject sensitive 

to the allergen, comprising administering to the subject the composition of claim 65. 

72. A method of treating sensitivity to a dog dander allergen in a subject sensitive 
to the allergen, comprising administering to the subject the composition of claim 66. 



30 
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73 . A method of detecting sensitivity in a subject to a dog dander allergen, 
comprising combining a blood sample obtained from the subject with a peptide of claim 41, 
under conditions appropriate for binding of blood components with the peptide and 
determining the extent to which such binding occurs. 

74. A method of claim 73, wherein the extent to which binding occurs is 
determined by assessing T cell function, T cell proliferation, B cell function, binding of the 
protein to antibodies present in the blood or a combination thereof. 
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75. A method of treating sensitivity to a dog dander allergen in a subject sensitive 
to the allergen, comprising administering to the subject the composition of claim 68. 

76. A method of treating sensitivity to a dog dander allergen in a subject sensitive 
5 to the allergen, comprising administering to the subject the composition of claim 69. 

77. A method of detecting sensitivity in a subject to a dog dander allergen, 

• comprising combining a blood sample obtained from the subject with a peptide of claim 48, 
under conditions appropriate for binding of blood components with the peptide and 
10 determining the extent to which such binding occurs. 

78. A method of claim 77, wherein the extent to which binding occurs is 
determined by assessing T cell function, T cell proliferation, B cell function, binding of the 
protein to antibodies present in the blood or a combination thereof. 

15 

79. An antibody specifically reactive with a peptide of claim 41 . 

80. An antibody of claim 83 which is a monoclonal antibody. 
20 81. An antibody specifically reactive with a peptide of claim 44. 

82. An antibody of claim 81 which is a monoclonal antibody. 

83 . An antibody specifically reactive with a peptide of claim 48 . 

84. An antibody ofclaim 83 which is a monoclonal antibody. 

85 . An antibody specifically reactive with a peptide of claim 5 1 . 
30 86. An antibody ofclaim 85 which is a monoclonal antibody. 

87. AT cell clone specifically reactive with a peptide of claim 41 . 

88. A soluble T cell receptor specifically reactive with a peptide ofclaim 41. 

89. An antibody specifically reactive with the T cell receptor of claim 88. 

90. The antibody of claim 89 wherein the antibody is a monoclonal antibody. 
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91 . A T cell clone specifically reactive with a peptide of claim 48. 

92. A soluble T cell receptor specifically reactive with a peptide of claim 48. 
5 93. An antibody specifically reactive with the Tcell receptor of claim 92. 

94. The antibody of claim 93 wherein the antibody is a monoclonal antibody. 

95. An isolated peptide having an activity of a dog dander allergen, £anl h which 
10 comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 105, 

SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, and SEQ ID NO: 109. 
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* MULTIPLE SEQUENCE ALIGNMENT. * 

Setting of computation parameters 

K-tuple value si 
Gap penalty s 30 
Window size s 5 
Filtering levels 2.5 
Open gap cost s 25 
Unit gap cost s 25 

Setting of other parameters 

Cntricter to show that, a position is well conserved* . 
Alignment 
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Canf2 cDNA CLONING STRATEGY 

poly(A) RNA isolated from dog parotid gland 
Double stranded cDNA 
■PCR using degenerate primers based on Canf 2 sequence 
Agarose gel purify PCR product 
Direct sequencingl>f the PCR fragment 

\ 

Partial nucleotide sequence of Canf 2 

\ . 

Design Canf 2 specific pnmers 
PCR using spedfic primers/cDNA 

I 

Further sequence of Canf 2 cDNA 

\ 

Generate Canf 2 specific probe 

\ , 

Screen dog cDNA library 
Full length Canf 2 cDNA 
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Dog ds cDNA with primer/adaptors 
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1. Sequential rounds of PCR 

2. Sequencing of PCR product 



5* portion of cDNA 
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1. Sequential rounds of PCR 

2. Sequencing of PCR product 
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AP2 5' GGG TCT AGA GGTACC GTC CG 3' 

AT/AL: 5 1 GGG TCT AGAGGT ACC GTC CGA TCG ATC ATT 3' 

TAGC TAG TAAp 5' 
G 
C 
3'-T 

5»- GGG CTC GAG GTC GAG TTT TTT TTT TTT TTT TT (GAC) 
D2 -l: nnnnGATCC CAG ATC GGA CTT ATT GGA GGC-3' 
D2-2: 5*- QGQQG&£££ GGA GGC CAG GGC AAC GGA-2' 
D2-3: nnnGGLTCC AAC GGA GTG CCA CCT CCC-3' 
D2-4: ^ '-fincicidA ATTC GAG GAG CTG TCT GGG AGG TGG-3' 
D2-5: 5'- rraraiAATTC AGG TGG CAC TCC GTT GCC CTG-3' 
D2-6: 5'- rraraAATTC GCC CTG GCC TCC AAC AAG TCC -3' 
D2-9: 5'- r.aaaa A ATTC GAG GGA AAC CAT GAG GAG CC -3* 
D2-11: 5'- GGA CTT GTT GGA GGC CAG GGC -3' 
D2-12: 5'- r.aaaaAATTC ATC AAA CCC TGG GGG CAC TT -3' 
D2-13: 5'- nnnnGATCC AA GTG CCC CCA GGG TTT GAT -3' 
D2-17: 5'- CAC GGG GAT ATC CTT ATA CC -3' 
D2-19: 5'- G TAC AAC GAT GAC ACC AG -3' 
D2-20: 5'- TGT CCC CCG AGC CTA T -3' 
D2-23: 5'- TC CTA CTG CTG ACC GTG G -3' 



i 



WO 94/16068 



19/31 



* -3 

J) 

PCTAJS93/12468 




WO 94/16068 PCTAJS93/12468 

20/31 



AGAGCTGGACCCGT 15 

gS^C^^S^GTCTCCAGCTGTCGCC^CCCAGACAGAAGGTGCTGTGGAC 195 



TGGAGCTCCTACTGCTGACCGTGGGCCTGGCACTGATCTGTGGCCTCCAGGCTCAGGAG 255 

mqlllltvglali cglqaqe 



-19 



GGAAACCA~TGAGGAGCCCCAGGGAGGCCTAGAGGA^ 315 

GG1» EELSGRWH & v 

20 

TTTTCATC 375 



GNHE EPQ- 

10 20 



CTGGCCTCCAACAAGTCCGATCTGATCAAACCCTGGGGGCACITCAGGGTTTTCATC 
LASNKSDLIKPWGHFRV FI 

■an 40 



GCO 
A 

30 



CACAGCATGAGCGCAAAGGACGGCAACCTGCACGGGGATATCCTTATACCGCAGGACGGC 435 
HS MSAKDGNLHGDILIPQ DG 



50 60 



495 



QCEKVSLT AFKTATSNKF DL 

70 80 

GAGTACTGGGGACACAATGACCTGTAGCTGGCAGAGGTAGACCCCAAGAGCTACCTGATT 
EYWGHNDLYLAEVDPKSY L I 



90 100 



CTCTACATGATCAACCAGTACAACGATGACACCAGCCTGGTGGCTCACTTGATGGTCCGG 615 
LYMINQYNDDTSLVAHLMVR 

110 120 



GACCTCAGCAGGCAGCACXSACTTCCTGCCGGCATTC 675 

130 



D L S R Q Q D F L P A F E S V C E D'^G 



CTGCACAAGGACCAGATTGTGGTTCTGAGCGATGACGATCGCTGCCAGGGTTCCAGAGAC 73 5 
LHKDQIVVLSDDDRCQGS RD 

150 160 
GCCACGCAGAGAGCCAAGCAGCAGGATCT.CACCTGCCTGAGTACGGT 792 



TAGGGCCTCAI 
+ 



WO 94/16068 



21/31 



i\& If/2 S 
f Co^f a VS NATIVE Formatted Alignment 



CANF2 NATIVE 



CANF2 NATIVE 



CANF2 NATIVE 



"^UXLTVCL ALICGLQAdE GNHEEPQGGL EELSGRWHSV ALAsklCSDLI 
MQLLLLTVGL »^ gkheepqggL EELSGRWHSV AI*s(foSDLI 



j^SHj^FI HSMSAKDGNL HGDILIPQDG QCEKVSLTAF KTATSNKFDL 
> PWGHFF T 



EYWGHNDLYL AEVDPKSVLI LYMINQYNDD TSLVAHLMVR DLSRQQDFLP 



50 
31 



100 
38 



150 
38 



CANF2 NATIVE 



AFESVCEDIG LHKDQIWLS DDDRCQGSRD 



180 
38 



WO 94/16068 



23/31 

f {(r XI /zsr 



Untitled-2 Formatted Alignment 



Canf 2 
RAT A2U 
MOP 6 MOUSE 



Canf 2 
RAT A2U 
HUP 6 MOUSE 



Canf 2 
RAT A2U 
KUP6_MOUSE 



Canf 2 
RAT A2U 
HUP 6 MOUSE 



LLL1 rv 



31 1 1 J C -GF A 
51 1 1 si C -Vt A 



ihNHEEPQG gixel: 
E^NSERGNL DVD! 
E^SSTGRNF NVEKItt 



FISDbG^RL 



F 4QHIDVLEN 



iJD^GDILIPQ D< 
1 3FKLCIKE HGE 
I/LKFHTVR D: 





48 
49 
48 



98 
99 
98 



148 
149 
148 



180 
181 
180 



0 



WO 94/16068 



24/31 
f^* lift /2 £ 



FCT/US93/12468 



iillsli! liss ill 

■ HHHnHEIIIIIiHUHiiBEIini 




OS* v 



WO 94/16068 



26/31 



PCT/US93/12468 




WO 94/16068 



27/31 



a/c/j 



Formatted Alignment 



la cons 
lc cons 
lj c ns 
Consensus 

la cons 
lc coos 
lj cons 

Consensus 

la cons 
lc cons 
lj cons 

Consensus 

la cons 
lc cons 
lj cons 

Consensus 

la cons 
lc cons 
lj cons 

Consensus 

la cons 
lc cons 
lj cons 

Consensus 

la cons 
lc cons 
lj cons 

Consensus 

la cons 
lc cons 
lj cons 

Consensus 

la cons 
lc cons 
lj cons 

Consensus 



AGAGCTGGAC 



CCg7g7gTGT GCTGGCCAAT GAGCCCTGGA GGGTCCGGCT 



CCAGAGTACC 



CTCXTGGCAC 



AGGGCCGAGT CCATCGGGAC AGATGAACCT 



AGAGGACTCC 



ACTGCCCTCC 



CATCCACGGG GCCGGGTCAC CAGACTCTGC 



50 



50 



100 



100 



150 



150 



AAGTCTCCAG 



C1GTCGCCAA 



ACCCAGACAG AAGGTGCTGT GGACATGCAG 



200 
3 

200 




rr ^^r rTT at accgc__aggacggcca 



450 



WO 94/16068 



a/c/j 



28/31 

Formatted Alignment 



PCT/US93/12468 



la cons 
lc cons 
lj cons 

Consensus 

la cons 
lc cons 
lj cons 

Consensus 

la cons 
lc cons 
lj cons 

Consensus 

la cons 
lc cons 
lj cons 

Consensus 

la cons 
lc cons 
lj cons 

Consensus 

la cons 
lc cons 
lj cons 

Consensus 

la cons 
lc cons 
lj cons 

Consensus 

la cons 
lc cons 
lj cons 

Consensus 

la cons 
lc cons 
lj c ns 

Consensus 



^Tr TrA er r***™** Gft CTGCC ACC AC^AACAAAT TTGACgTGG*| 



^rjrzttGA ^ ^ ^TP^rc T ^rprrrnnr AGAGGTAGAC CCCAAGAGCTl 



irrrffiTTCT r^r^A-rc a apc?agt ac a ACGATGACAC QJKG^TQ^ 



SCTCAdf^ TflBTflrflQGA C ™r*flT*flC CAGCAGGACT TCCTGCCGgq 




P ^ArGCAGAG AfiCCAAGCA fi CAGGATCTCA CCTGCCTGAG 



3ACTCAGACC 



tHtJsgcJcg^cggacaccg TACTCAGCTC TGCGTCCCTC TCTGCGAACC 



CTCCAGGTGA TCCCAGCAAC AACACCCACC TGCGCTTCCA TGTGCGGCCC 



295 
496 
299 

500 



345 
546 
349 

550 



395 
596 
399 

600 

445 
646 
449 

650 

495 
696 
499 

700 



545 
746 
549 

750 



595 
785 
599 

800 



645 
791 
649 

850 



695 
791 
699 

900 



WO 94/16068 



la cons 
lc cons 
13 c ns 
Consensus 

la cons 
lc cons 
lj cons 
Consensus 



f tl , 2W 25 

g/cq Formatted Alignment 



TGTCCAGCCT 



GCGCCCACTC 



CCtGCCTGGG 



C^CCACACA CTCCCCAGCC 



CCCTGCTATG 



GTCCCTCCTC 



GCATAATAAA 



GGACATTCCG 



TTCAAAAA 



745 
791 
749 

950 



793 
791 
774 



996 



WO 94/16068 PCT/US93/12468 



30/31 
fl* Xi, J 2. J 




IS 



WO 94/16068 



31/31 



MEAN STIMULATION INDICES 



iCANFI 




CONST #1 



CONST #2 



CONST #3 



